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' _| ' I Abstract. We discuss the dynamics and thermodynamics of systems with weak long-range interac- 

'q . tions. Generically, these systems experience a violent collisionless relaxation in the Vlasov regime 

' leading to a (usually) non-Boltzmannian quasi stationary state (QSS), followed by a slow colli- 

I sional relaxation leading to the Boltzmann statistical equilibrium state. These two regimes can be 

" ■ explained by a kinetic theory, using an expansion of the BBGKY hierarchy in powers of 1/N, where 

is the number of particles. We discuss the physical meaning of the different functionals appearing 
in the analysis: the Boltzmann entropy, the Lynden-Bell entropy, the "generalized" entropies arising 
■ in the reduced space of coarse-grained distribution functions, the Tsallis entropy, the generalized 

//-functions increasing during violent relaxation (not necessarily monotonically) and the convex 
Casimir functionals used to settle the formal nonlinear dynamical stability of steady states of the 
Vlasov equation. We show the connection between the different variational problems associated 
I with these functionals. We also introduce a general class of nonlinear mean field Fokker-Planck 

^ ■ (NFP) equations that can be used as numerical algorithms to solve these constrained optimization 

Q ' problems. 

^ ■ 1. INTRODUCTION 

> 

Csl . Recently there was a renewed interest for the dynamics and thermodynamics of systems 

^ I with long-range interactions [1]. These systems are numerous in Nature and appear in 

different domains of physics, astrophysics, fluid mechanics and biology. Systems with 
long-range interactions have very particular properties that contrast from those of more 
^ I familiar systems with short-range interactions like gases and neutral plasmas. First of 

O ■ all, they can organize spontaneously into "coherent structures" in physical space. This 

corresponds to stars, galaxies, clusters of galaxies... in astrophysics, jets and vortices 
k>( I (like the gulf stream or jovian vortices) in two-dimensional geophysical turbulence, 

5^ ■ clumps and filaments (interpreted as the initiation of a vasculature) in the chemotaxis 

of biological populations, clusters in the HMF model... These systems have a very 
special thermodynamic limit corresponding to N ^ +0° in such a way that the coupling 
constant tends to zero (fc ~ 1/A^ ^ 0) while the volume remains unity (V ~ 1). This 
limit describes systems with weak long-range interactions. For these systems, the mean 
field approximation becomes exact for A'^ — > +0°. Generically, these systems display 
two successive types of relaxation: a violent collisionless relaxation leading to a Quasi 
Stationary State (QSS) and a slow coUisional relaxation tending to the Boltzmann 
distribution of statistical equilibrium. The QSS is a stationary solution of the Vlasov 
equation on the coarse-grained scale. It is usually described by a non-Boltzmannian 



distribution and its lifetime diverges with A'^. Mathematically, the Bolzmann equilibrium 
is obtained when the t +00 limit is taken before the A'' +00 limit and the QSS is 
obtained when the A'^ +0° limit is taken before the t +00 limit. These two regimes 
can be explained by a kinetic theory, using an expansion of the BBGKY hierarchy in 
powers of 1/N with — > +00 [0, Hi 0, IH]. Other peculiar behaviors in the dynamics and 
thermodynamics of systems with long-range interactions have been evidenced: negative 
specific heats in the microcanonical ensemble [@, 0, S, @], ensemble inequivalence 



[ lol [TlL II2L | 13ll . numerous types of phase transitions persisting at the thermodynamic 
limit lll4i" 151. non-trivial dependence of the coUisional relaxation time with the number 
N of particles \lJ^, algebraic decay of the correlation functions [jlsL fl9l I20I I2II] . 
front structure and slow relaxation of the velocity tails [|22L l2l|] . metastable states whose 
lifetimes diverge exponentially with A^ |23i l^], out-of-equilibrium phase transitions 125. 
I27I . I28I I29I] . kinetic blocking due to the absence of resonanc es [l2ll [30|]. curious effects 



due to spatial inhomogeneity [31], non-ergodic behaviors [32, 33], glassy dynamics [34] 
etc... Certainly, the physical richness of these systems (Pandora's box) will lead to further 
investigations and applications. 

In this paper, we discuss the meaning of the different functionals appearing in the 
study of the dynamics and thermodynamics of systems with weak long-range interac- 
tions. The Boltzmann entropy describes the statistical equilibrium state resulting from 
a coUisional relaxation. The Lynden-Bell entropy 11351] and the "generalized" entropic 
functionals arising in the reduced space of coarse-grained distributions [|36|] describe 
the QSS resulting from a complete violent coUisionless relaxation. The maximization of 
these entropic functionals rely on an assumption of ergodicity (efficient mixing). Tsallis 
entropies [|37|] have been proposed as an alternative when the system has a non-ergodic 
behaviour so that violent relaxation is incomplete. Incomplete relaxation may also be un- 
derstood by developing a kinetic theory of the process of violent relaxation [ 38ll39l l5|]. 
Generalized //-functions ll40ll are functionals of the coarse-grained distribution function 
that increase during violent relaxation (not necessarily monotonically). Finally, convex 
Casimir functionals are used to settle the formal nonlinear dynamical stability of steady 
states of the Vlasov equation [pTll42|] . Although the above-mentioned functionals may 
have similar mathematical forms, they have very different physical interpretations. 



2. SYSTEMS WITH WEAK LONG-RANGE INTERACTIONS 
2.1. The A/^-body Hamiltonian system 

We consider an isolated system of particles in interaction whose dynamics is fully 
described by the Hamilton equations 

dri dH d\i dH 

dt o\i dt oYi 
1 ^ 

^ '=1 i<i 



where Uij = u{rj — rj) is a binary potential of interaction depending only on the absolute 
distance |r, — rj] between the particles. Basically, the evolution of the A^^-body distribu- 
tion function /V(ri , Vi , r^f, \n, is governed by the Liouville equation 




where 

is the force by unit of mass experienced by particle i due to the interaction with the other 
particles. The Liouville equation is equivalent to the A/^-body Hamiltonian system ([U). 
Now, the basic postulate of statistical mechanics states that: At statistical equilibrium, 
all the microscopic configurations that are accessible (i.e. that have the correct value 
of the energy) are equiprobable. There is no guarantee that the dynamics will lead the 
system to that "uniform" state. This relies upon a hypothesis of ergodicity which may 
not always be fulfilled for complex systems (some regions of the F-phase space could 
be more probable than others). This is why it is important to develop a kinetic theory in 
order to vindicate (or not!) this result. In the sequel, following the wisdom of ordinary 
statistical mechanics, we shall assume that all the accessible microscopic configurations 
are equiprobable at equilibrium (for t +°°) but we must keep in mind the importance 
of this postulate. 

At statistical equilibrium, the A^-body distribution is given by the microcanonical 
distribution expressing the equiprobability of the accessible configurations 

/V(ri,vi,...,r^,vw) = — ^5(£-//(ri,vi,...,ryv,Viv)). (4) 

Since / PNWidridyi = 1, the density of states with energy E is given by 

g{E)= / 5(£-//(ri,vi,...,r^,VAf))]^Jr/Jv/. (5) 

i 

This is the normalization factor of the A^-body distribution function dH). The micro- 
canonical entropy of the system is defined by S{E) = ln^(£') and the microcanonical 
temperature by 15(E) = I /T{E) = dS/dE (we take the Boltzmann constant kB = 1). We 
introduce the reduced probability distributions 

Pj(xi,...,Xj) = I Ppj(xi,...,XN)dXj+i...dxM, (6) 

where x = (r, v). For identical particles, the average density in /i-space is related to the 
one-body distribution function by 



N 

= {Y^m5(r-ri)5(\-Vi)) =NmPi(r,\), 

i=l 



(7) 



and the average value of the energy is 

f v2 
E = {H)=Nm Pi{r,\)—drd\ 

+^N{N - l)m2 Ju{r - r')P2{r, v, r', \')drd\dr'd\' . 



By differentiating the defining relation for Pj and using Eq. (H 
equilibrium BBGKY-like hierarchy for the reduced distributions 12 



(8) 

, we can obtain an 



1 



dPl^ 

<?ri g{E) dE 



g{E)Pj 



j 

E 

i=2 



dPj _ 1 



dri g{E)dE 



g{E)Pj+, 



g{E)Pj 



m\\. 



dXj+i, 

(9) 
(10) 



d\i g{E) dE [' 

This is the counterpart of the equilibrium hierarchy in plasma physics. It is however 
more complex in the present situation because it has been derived in the microcanonical 
ensemble. Since statistical ensembles are generically inequivalent for systems with long- 
range interactions, we must formulate the problem in the microcanonical ensemble 
which is the fundamental one. We note that the terms involving the derivative of the 
density of states can be split in two parts according to 



1 



g{E) dE [ 



g{E)Pj 



(11) 



The terms with the E derivative would not have emerged if we had started from the 
Gibbs canonical distribution [HI. 



2.2. Thermodynamic limit and mean-field approximation 

We define the thermodynamic limit as N +oo in such a way that the normalized 
energy £ = E /{u^N^m^) and the normalized temperature rj = fiNm^u^ are fixed, where 

represents the typical value of the potential of interaction. Since the normalized cou- 
pling constant /3m^M* = r\/N l/N goes to zero for A'^ +0°, we are studying systems 
with weak long-range interactions. For 3D self-gravitating systems {u = — G/|r — r'|, 

= G/R, 77 = pGMm/R, e = ER/GM^), for 2D point vortices = -2^1n|r-r'|, 

= 1, r] = PNy^, e = E/T^) and for the HMF model (u = -^cos(0 - 6'), = k, 
T] = pkN, £ = E/kN^) we recover the dimensionless parameters already introduced 
in the literature. In general, the potential of interaction is written as M(r//) = ku{rij) 
where k is the coupling constant (e.g., G for self-gravitating systems or k for the HMF 



^ Two-dimensional point vortices (with circulation 7) are special because they have no inertia. Their 
kinetic theory, which must take into account this particularity, is developed in [3C(|. 



model). The dynamical time is to ~ R/vtyp ~ R/ \/Nmu^ where the typical velocity 
Vtyp ~ {Nmu^y/^ has been obtained by equating the kinetic energy ~ NrmP' and the 
potential energy ~ N^irp-u^. For a long-range interaction, we have = k/R^ where a 
is less than the dimension d of the system. Then, we get to ~ 7?(2+«-'')/2/ \/kp where 
p ~ M/y is the average density. By a suitable normalization of the parameters, the 
proper thermodynamic limit for systems with long-range interactions is such that the 
coupling constant behaves like ~ ~ 1 /N, while m~l,/3~l,£'/A''~l and V ~ 1. 
This implies that |r| ~ 1, |v| ~ 1, w ~ 1 /N, |F(j ^ z) | ~ 1 /N and ~ 1. For example, 
in the case of self-gravitating systems, this scaling corresponds to G ~ 1 /N while m ~ 1, 
j8 ~ 1, E/A^ ~ 1, y ~ 1, f/) ~ \/y/Gp 1 and in the case of the HMF model, it corre- 
sponds to ~ 1 /N while /3 ~ 1, E/N ~l,y~l,?£)~l/ ^^/kp ~ 1 (for self-gravitating 
systems, we could also consider the dilute limit N +oo with N/R ~ 1 while G ~ 1, 



m~l,/3~l,£'/A/^~l Il43n but in that case we notice that to ~ N). Using another 
normalization of the parameters, the thermodynamic limit is such that the mass of the 
particles behaves like m ~ 1 /N while fc~M*~l,/3~A'^, £'~l,y~l and ~ 1. In 
this scaling, the total mass M ~ Nm is of order unity. For example, in the case of self- 
gravitating systems, this scaling corresponds to zn ~ 1 /N while G~1,/3~A'', £'~1, 
y ~ 1 , ?o ~ 1 / v^Gp ~ 1 and in the case of 2D vortices to 7 ~ l/N, while /3 ~ A'^, £ ~ 1 , 
y ~ 1, ?D ~ l/ft) ~ R^/r ~ 1 (where T = Ny is the total circulation). 
The first two equations of the equilibrium BBGKY-like hierarchy are 

•"''(x,) = -(A'-l)m^ /^f/?P, + #V''2. (12) 



^ri J dr\ \ dE 



We note that the ratio of dPj/dE on fiPj is of order l/(£/3) = l/(e77A^). Therefore, 
in the thermodynamic limit — +0° with £, r\ fixed, the second term in Eq. (fTTI) is 
always negligible with respect to the first. We now decompose the two- and three-body 
distribution functions in the suggestive form 

P2(xi,X2)=/'l(xi)Pi(x2)+P^(xi,X2), (14) 



P^ixi , X2, X3) = Pi {xi)Px {X2)PI (X3) + P2{xu X2)Pl (X3) 
+P^(xi,X3)Pl(x2) +P^(x2,X3)/'i(xi) + (xj , X2, X3) . (15) 

This decomposition corresponds to the first terms of the Mayer expansion in plasma 
physics. The non-trivial correlations Pj are called the cumulants. We can now substitute 
these decompositions in the equilibrium BBGKY-like hierarchy. The first equation (fT2l) 
of the hierarchy can be written 

^(Xi) =-/3(A^-l)zn2pi(xi) / Pi(x2)^JX2 

-l5{N-lWlp^ixuX2)^dx2-iN-lWl^^{xuX2)dx2. (16) 



In the thermodynamic limit defined previously, it can be shown that the cumulants 
are of order A'^^*^"^^) [|2D. In particular, we have 

P2(xi,X2)=/'l(xi)/'i(x2) + 0(l/iV). (17) 

Therefore, the mean field approximation is exact for N +0°. In the thermodynamic 
limit, the two-body correlation function is the product of two one-body distribution 
functions: i'2(xi,X2) ~ i'i(xi)Pi(x2). 



2.3. The mean field equifibrium distribution 

Taking the limit +00 and using Eq. (flTl) . the first equation (fT6l) of the equilibrium 
BBGKY-like hierarchy becomes 

Vp(r) = -[5mp{r)V I p{r')u{r-r')dr', (18) 

where p (r) = NmPi (r) is the spatial density. After integration, this can be written in the 
form of a meanfield Boltzmann distribution 

p(r) =AV^'"*W, (19) 

where 



4)(r) = j p(r>(r-rVr', 



(20) 



is the potential produced self-consistently by the smooth distribution of particles. There- 
fore, the equilibrium density profile of the particles is determined by an integrodijferen- 
tial equation or ([T9l)-® . On the other hand, for ^ -foo, Eq. (dO]) leads to 



^ = -/3mi'iVi, (21) 
so that the velocity distribution (^(v) = NmP\ (v) is maxwellian 

^{y)=A!'e-^"''^. (22) 

Combining Eqs. ( fT9l ) and (l22l) and introducing the distribution function /(r, v) = 
NmP\{r,y), we get the mean-field Maxwell-Boltzmann distribution function 

/(r,v)=Ae-^'"(^+*W), (23) 
where 4>(r) is given by Eq. (I20l) and p = / fdy. If we define the entropy by 

Sn = - /VlnPjvJxi...JxAf, (24) 



and use the mean-field approximation 

Pn{xi,...,xn)=Pi{xi)...PiM, (25) 
valid at the thermodynamic limit +0° defined previously, we obtain 

S = -N J Pi{x)\nPi{x)dx. (26) 

In terms of the distribution function / = NmPi , we get 

S=- [ -\n( —]drdy. (27) 



On the other hand, in the mean-field approximation, the average energy ^ is given by 

E^Nm J Pi ^—drdy + ]^N^r7? j Pi {r)u{r - y')Pi {r')dYdr' . (28) 
This can also be written 

E = j /^JrJv-f ^ j p(r)4>(r)Jr. (29) 

The mass is given by 

M = j fdrd\. (30) 

Using these relations, we can relate the inverse temperature /3 and the constant A in the 
mean field Maxwell-Boltzmann distribution (|23l) to the energy E and the mass M. 



2.4. Statistical equilibrium state: the Boltzmann entropy 

We wish to determine the most probable distribution of particles at statistical equilib- 
rium by using a combinatorial analysis, assuming that all accessible microstates (with 
given energy E and mass M) are equiprobable. To that purpose, we divide the /i-space 
{r, v} into a very large number of microcells with size h. We do not put any exclusion, 
so that a microcell can be occupied by an arbitrary number of particles. We shall now 
group these microcells into macrocells each of which contains many microcells but re- 
mains nevertheless small compared to the phase-space extension of the whole system. 
We call V the number of microcells in a macrocell. Consider the configuration {n,} 
where n,- is the number of particles in the macrocell i. Using the standard combinatorial 
procedure introduced by Boltzmann, the probability of the state {n,}, i.e. the number of 
microstates corresponding to the macrostate {n,}, is given by 



(31) 



This is the Maxwell-Boltzmann statistics. As is customary, we define the entropy of the 
state {n,} by 

S{{m})=\nW{{m}). (32) 

It is convenient here to return to a representation in terms of the distribution function 
giving the phase-space density in the i-th macrocell: // = /(r,-, v,) — nim/vh'^ (where d 
is the dimension of space). Using the Stirling formula Inn! ~ nlnn — n for n ^ 1, we 
have 

\nW{{ni}) = -Ynilnni = -VvH^-Xn-. (33) 

. -mm 

I I 

Passing to the continuum limit v ^ 0, we obtain the usual expression of the Boltzmann 
entropy 

SB[f]=- [ Un^drdy, (34) 
,/ m m 

up to some unimportant additive constant. Assuming ergodicity, the statistical equi- 
librium state, corresponding to the most probable distribution of particles (i.e. the 
macrostate that is the most represented at the microscopic level), is obtained by max- 
imizing the Boltzmann entropy (l34l) while conserving the total mass (l30l) and the total 
energy (|29l ). Introducing Lagrange multipliers and writing the variational principle in 
the form 

dSB-lidE-a5M = 0, (35) 
we obtain the meanfield Maxwell-Boltzmann distribution 

y = (36) 

This returns the equations obtained in Sec. 2.3. The potential 4> is obtained by solving 
the integrodifferential equation (fT9l)-(l20l) and the Lagrange multipliers A and /3 must be 
related to the constraints M and E using Eqs. (I30l)-(l29l). Then, we have to make sure that 
the distribution is a maximum of Sg at fixed mass and energy, not a minimum or a saddle 
point. Therefore, this method gives a condition of thermodynamical stability which was 
not obtained in Sec. 2.3 (to get a similar relation, we need to consider the next order 
term in the expansion in \/N oi the equilibrium BBGKY-like hierarchy). 
The maximization problem 

max | £[/]=£, M[/] = M}, (37) 

corresponds to a condition of microcanonical stability. Alternatively, the minimization 
problem 

mm {FB[f]=E[f]-TSB[f] \ M[f]=M], (38) 

corresponds to a condition of canonical stability, where is the Boltzmann free en- 
ergy. The optimization problems (l37l) and (l38l) have the same critical points (cancelling 
the first order variations of the thermodynamical potential). On the other hand, canoni- 
cal stability (l38l) implies microcanonical stability (l37l) : if /(r, v) is a minimum of FB[f] 



at fixed mass, then it is a maximum of at fixed mass and energy. However, the 
converse is wrong in case of ensemble inequivalence. This means that the ensemble of 
solutions of (1381) is included in the ensemble of solutions of ([37l) but the two ensembles 
may not coincide for systems with long-range interactions q Ensemble inequivalence 
was first encountered in astrophysics [@, jv]] where it was realized that configurations 
with negative specific heats are allowed in the microcanonical ensemble but not in the 
canonical ensemble (see reviews in |T3, 15]). This notion of ensemble inequivalence has 
been developed and formalized recently at a general level [Hi [3 The condition 
of ensemble equivalence or ensemble inequivalence, related to the concavity of the en- 
tropy S{E), is discussed in detail in these papers. Ensemble equivalence/inequivalence 
can also be deduced from the study of the series of equilibria /3(£') using the Poincare 
criterion Hfisll. 



3. KINETIC THEORY FROM THE BBGKY HIERARCHY 

In this section, we derive a general kinetic equation ( |53l) for Hamiltonian systems with 
weak long-range interactions. We start from the BBGKY hierarchy and use a systematic 
expansion of the solutions of the equations of this hierarchy in powers of 1 /A^ in a proper 
thermodynamic limit A'' +oo. The kinetic equation (l53l) is valid at order 0{\/N). 



3.1. The l/N expansion 



From the Liouville equation ^ we can construct the complete BBGKY hierarchy for 
the reduced distribution functions It reads 




(39) 

This hierarchy of equations is not closed since the equation for the one-body distribution 
/'i(xi,f) involves the two-body distribution P2(xi,X2,^), the equation for the two-body 
distribution /'2(xi,X2,?) involves the three-body distribution /'3(xi,X2,X3,?), and so on... 
The idea is to close the hierarchy by using a systematic expansion of the solutions in 
powers of \/N in the thermodynamic limit +oo. Considering the scaling of the 
terms in each equation of the hierarchy, we argue that there exists solutions of the whole 
BBGKY hierarchy such that the correlation functions Pj scale like \/N^~^ at any time 



Since energy is non-additive for systems with long-range interactions, the canonical ensemble cannot be 
used to study a subpart of the system fTol llll. Therefore, the problem ( [38] l has no physical justification for 
an isolated Hamiltonian system interacting via purely long-range forces. However, the canonical ensemble 
is justified for dissipative systems, such as Brownian systems with long-range interactions, that experience 
a friction force and a stochastic force (modelling short-range interactions) in addition to long-range forces 
Here, we shall only consider isolated Hamiltonian systems so that the fundamental ensemble is the 
microcanonical ensemble. However, the minimization problem (|38] | can be useful mathematically since it 
offers a sufficient condition of microcanonical stability (since ( [38] ) =^ ( [37] l) as discussed in the text. 



i^. This implicitly assumes that the initial condition has no correlation, or that the initial 
correlations respect this scaling (if there are strong correlations in the initial state, like 
"binaries", the kinetic theory will be different from the one developed in the sequel). If 
this scaling is satisfied, we can consider an expansion of the solutions of the equations 
of the hierarchy in terms of the small parameter l/N. This is similar to the expansion 
in terms of the plasma parameter made in plasma physics. However, in plasma physics 
the systems are spatially homogeneous while, in the present case, we shall take into 
account spatial inhomogeneity. This brings additional terms in the kinetic equations 
that are absent in plasma physics. Therefore, strictly speaking, the hierarchy that we 
consider is different from the ordinary BBGKY hierarchy. If we introduce the notations 
/ — NmPi (distribution function) and g = N^P2 (two-body correlation function), we get 
at order 1/A^ [£]: 

dfi , dfi , N-\,^, df, 



dt 



, dg 1 dfi 

3-+V1 — + F 1 — + 2-> 1 /23— 

+J_ /F(3^1)g(x2,X3,0-^X3 + (1^2)=0, (41) 
dvi J m 

where we have introduced the abbreviations /i = /(ri,vi,?) and /2 = /(r2,V2,0- We 
have also introduced the mean force (by unit of mass) created in ri by all the particles 

(F) 1 = / F(2 ^ 1 ) ^ Jr2 JV2 = - V4>i , (42) 
J m 

and the fluctuating force (by unit of mass) created by particle 2 on particle 1 : 

^(2^1)=F(2^1)-l(F)i. (43) 

These equations are exact at the order 0{l/N) where the three-body correlation function 
can be neglected. They form therefore the right basis to develop a kinetic theory for 
Hamiltonian systems with weak long-range interactions. We note that these equations 
are similar to the BBGKY hierarchy of plasma physics but not identical. One difference 
is the {N —\)/N term in Eq. (|40|) . The other difference is the presence of the fluctuating 
force ^{2 1) instead of F{2 1) due to the spatial inhomogeneity of the system. 
In plasma physics, the system is homogeneous over distances of the order of the Debye 
length so the mean force (F) vanishes. 



3.2. The limit N +0°: the Vlasov equation 

Recalling that ~ 1 /N, we note that 

/'2(X1 , X2, = A (Xl , 0^1 (X2, + 0{ 1 /N) . 



(44) 



If we consider the limit N — > +00 (for a fixed time t), we see that the correlations between 
particles can be neglected so the two-body distribution function factorizes in two one- 
body distribution functions i.e. /'2(xi,X2,?) = Pi{xij)Pi{x2,t). Therefore the mean field 
approximation is exact in the limit N +00. Substituting this result in Eq. (|39l) . we 
obtain the Vlasov equation 

^ + v,|^ + (r),|A = 0. (45) 
dt dr\ dy\ 

This equation also results from Eq. (|40l) if we neglect the correlation function g = N^P!^ 
in the r.h.s. The Vlasov equation describes the collisionless evolution of the system up to 
a time at least of order Nt^, (where t^, is the dynamical time). In practice, N ^ I that 
the domain of validity of the Vlasov equation is huge (for example, in stellar systems 
N ~ 10^ — 10^^ stars). When the Vlasov equation is coupled to a long-range potential 
of interaction it can develop a process of phase mixing and violent relaxation leading to 
a quasi-stationary state (QSS) on a very short timescale, of the order of the dynamical 
time to fi35i1 . This process will be discussed in Sec. 4. 



3.3. The order 0{l/N):a general kinetic equation 



If we want to describe the coUisional evolution of the system, we need to consider 
finite A'^ effects. Equations (l40l) and (|4T]) describe the evolution of the system on a 
timescale of order Nto- The equation for the evolution of the smooth distribution func- 
tion is of the form 



dfi dfi N 

■ + yi- — \- 



dt 



N 



dvi 



■■CN[fi 



(46) 



where Cn is a "collision" term analogous to the one arising in the Boltzmann equation. 
In the present context, there are not real collisions between particles. The term on the 
right hand side of Eq. (|46|) is due to the development of correlations between particles 
as time goes on. It is related to the two-body correlation function g{xi,X2,t) which is 
itself related to the distribution function /(xi,f) by Eq. (|4T]) . Our aim is to obtain an 
expression for the collision term Ca^[/] at the order l/N. The difficulty with Eq. ((4T]) for 
the two-body correlation function is that it is an integrodifferential equation. The second 
and third terms are advective terms, the fourth term is the source of the correlation and 
the fifth term takes into account collective effects. If we neglect the contribution of the 
integral in Eq. (|4T]) . we get the simpler coupled system 



dfi dfi N 



dt 



N 



-m 



F(2 l)^(xi,X2)JX2, 



(47) 



dg 
dt 



+ 



+ 



d d d d 

Vi— +V2— + (F)i— + (F)2 



^ri dr2 



d\i 



d\2 
m m 



0. 



(48) 



The integral that we have neglected contains "collective effects" that describe the polar- 
ization of the medium. In plasma physics, they are responsible for the Debye shielding. 
These collective effects are taken into account in the Lenard-Balescu equation through 
the dielectric function psl) . However, this equation is restricted to spatially homoge- 
neous systems and based on a Markovian approximation (see, e.g., [HI]). These assump- 
tions are necessary to use Laplace-Fourier transforms in order to solve the integro- 
differential equation (|4T1) . Here, we want to describe more general situations where the 
interaction is not shielded so that the system can be spatially inhomogeneous. If we ne- 
glect collective effects, we can obtain a general kinetic equation in a closed form (l53l) 
that is valid for systems that are not necessarily homogeneous and that can take into ac- 
count memory effects. This equation has interest in its own right (despite its limitations) 
because its structure bears a lot of physical significance. 

The equation (1481) for the correlation function can be written 



dt 



+ 



^(X|,<)-(X2,<), (49) 

m m 



where we have denoted the advective term by =Sf (Liouvillian operator). Solving for- 
mally this equation with the Green function 



G(f,0 =exp|- j^'^(T)JT|, 



we obtain 



g(xi,X2,/') = - / dxG{t,t-' 
Jo 



(50) 



f f 
^(xi,r-T)-(x2,r 
m m 



The Green function constructed with the smooth field (F) means that, in order to evaluate 
the time integral in Eq. (|5T1) . we must move the coordinates r, (? — t) and v/(f — t) of the 
particles with the mean field flow in phase space, adopting a Lagrangian point of view. 
Thus, in evaluating the time integral, the coordinates r, and V; placed after the Greenian 
must be viewed as r,(? — t) and \i{t — t) where 



rKO- fyi{t-s)ds, v,-(?-t) =v,(0- f{^){r,{t-s),t-s)ds. (52) 

Substituting Eq. ([SB in Eq. (|47]), we get 

d 



dt 



N 



(9vi 



(^vj* Jo 



dz dr2d\2F^i2^1,t)G{t,t-x) 



^'(2-^l)^ + ^'(1^2)^ 



f 

/(ri,vi,?-T)^(r2,V2,?-T). (53) 
m 



This kinetic equation can also be obtained from a more abstract projection operator 
formalism [46] or from a quasilinear theory based on the Klimontovich equation [[sj] 



(note that we can replace F^{2 ^ l,t) by ^^{2 — > in the first term of the r.h.s. 
of the equation since the fluctuations vanish in average). This kinetic equation (|53] ) is 
valid at order 1/A^ so that it describes the "coUisional" evolution of the system on a 
timescale of order Nto (ignoring collective effects). Equation (l53l) is a non-Markovian 
integrodifferential equation. If we implement a Markovian approximation /(ri,vi,? — 
t) ~ /(ri,vi,?), /(r2,V2,? — t) ~ /(r2,V2,0 and extend the time integral to -\-°°, we 
obtain 



X 



/(ri,v,,<)-(r2,V2,<). (54) 

m 



The Markov approximation is justified for A'^ ^ +00 because the timescale Mo on which 
/ changes is long compared with the timescale Tcorr for which the integrand in Eq. 
(|54l) has significant support (except for self-gravitating systems for which the force 



auto-correlation function decreases like \/t [|47|] ). We do not assume, however, that the 
decorrelation time is "extremely" short (i.e, Xcon- 0)- Therefore, in the time integral, 
the distribution functions must be evaluated at {r{t — t) , v(? — t) ) and (ri [t — x),y\{t — 
t)) where 



Ti{t-T)=ri{t)- I \i{t~s)ds, \i{t-x)=\i{t)- I {^){Yi{t-s),t)ds. 
Jo Jo 



(55) 



Comparing Eq. (l55l) with Eq. (|52|) . we have assumed that the mean force (F)(r,f) does 
not change substantially on the timescale Tcorr on which the time integral has essential 
contributions. 



3.4. Properties of the general kinetic equation 

When the system is spatially inhomogeneous, its coUisional evolution can be very 
complicated and very little is known concerning kinetic equations of the form (l53l) . For 
example, it is not straightforward to prove by a direct calculation that Eq. (l53l) conserves 
the energy. However, since Eq. (1531) is exact at order 0{l/N), the energy must be con- 
served. Indeed, the integral constraints of the Hamiltonian system must be conserved at 
any order of the 1/A'' expansion!!. On the other hand, we cannot establish the //-theorem 
for an equation of the form (l53l) . It is only when additional approximations are imple- 
mented (markovian approximation) that the //-theorem is obtained. To be more precise, 
let us compute the rate of change of the Boltzmann entropy 5s = — /^In^JriJvi 



As we have already indicated, Eq. ( |53] ) ignores collective effects. However, we do not think that this 
approximation alters the conservation of the energy. When we make a Markovian approximation (valid 
for +00) and extend the time integral to infinity, leading to Eq. ( |54l ). the conservation of the energy is 
shown in Sec. 3.5 for homogeneous systems and in ll48ll for inhomogeneous systems (using angle-action 
variables). 



with respect to the general kinetic equation (153]) . After straightforward manipulations 
obtained by interchanging the indices 1 and 2, it can be put in the form 



dx\dx2 



1 



/1/2 Jo 
xG{t,t-T 



dz 



^^(2^1)/2^ + ^^(1^2)/i 



^^(2^1)/2|4 + ^'(1 



2)/i 



(56) 



We note that its sign is not necessarily positive. This depends on the importance of 
memory effects. In general, the Markovian approximation is justified for A'^ +00 
because the correlations decay on a timescale Xcorr that is much smaller than the time 
~ Nto on which the distribution changes (with, again, the exception of gravity). In that 
case, the entropy increases monotonically as shown in Sec. 3.5 for homogeneous systems 
and in ll48ll for inhomogeneous systems (using angle-action variables). Now, even if the 
energy is conserved and the entropy increases monotonically, it is not completely clear 
whether the general kinetic equation (l53l) will relax towards the mean field Maxwell- 
Boltzmann distribution (l36l) of statistical equilibrium. It could be trapped in a steady 
state that is not the state of maximal entropy because there is no resonance anymore to 
drive the relaxation (see the discussion in Sees. 3.5 and 3.7 and in HSN ). Indeed, the 
kinetic equation (l53l) can admit several stationary solutions, not only the Boltzmann 
distribution. It could also undergo everlasting oscillations without reaching a steady 
state. The kinetic equation (l53l) may have a rich variety of behaviors and its complete 
study is of great complexity. 



3.5. Spatially homogeneous systems: the Landau and Lenard-Balescu 

equations 

If we consider spatially homogeneous systems, make a Markovian approximation and 
extend the time integration to infinity, the kinetic equation (1531) becomes 



dt dv 



Jt / Jr2Jv2F^(2^1,0i^^(2^1,f-T) 
dv^ Jo J 



x(^-4^)fiyut)^{y2j), (57) 

\ov[ 0V2J m 

where we have used F(2 ^ 1) = — F(l 2). Since (F) = 0, the particles follow linear 
trajectories with constant velocity to leading order in N. Therefore, v, (? — t) = v, and 
ri{t — t) = r,- — v,T where r, and v,- denote their position and velocity at time t. In that 
case, the integrals on time and position in Eq. ( 1571) can be easily performed in Fourier 
space iH] and we obtain the Landau equation 

^ = ;r(2;r)^m^|jv2Jkfc^F^(fc)25(k.w)(/2|^-/i|^), (58) 



where w = Vi — V2. Other equivalent expressions of the Landau equation are given in 
Jsl]. The Landau equation ignores collective effects. Collective effects can be taken 
into account by keeping the contribution of the last integral in Eq. (|4TI) . For spatially 
homogeneous systems, the calculations can be carried out explicitly in the complex plane 



Din and lead to the Lenard-Balescu equation 

where e(k, co) is the dielectric function 

e(k, (0) = 1 - (27tYu(k) I ^-^^-^dy. (60) 

J k • V — ft) 

We note that the Lenard-Balescu equation is obtained from the Landau equation by 
replacing the potential u{k) by the "screened" potential M(fc)/|e(k, k- V2)|. The Landau 
and Lenard-Balescu equations conserve mass and energy (reducing to the kinetic energy 
for a spatially homogemous system) and monotonically increase the Boltzmann entropy 



[1450. Thus, M = E = and Sb > (//-theorem). The coUisional evolution is due to 
a condition of resonance between the particles' orbits. For homogeneous systems, the 
condition of resonance encapsulated in the 5-function appearing in the Landau and 
Lenard-Balescu equations corresponds to k ■ vi = k ■ V2 with vi ^ \2- For d > I the 
only stationary solution is the Maxwell distribution. Therefore, the Landau and Lenard- 
Balescu equations relax towards the Maxwell distribution. Since the collision term in 
Eq. (l53l) is valid at order 0{l/N), the relaxation time scales like 

tRr^NtD, {d>\) (61) 

as can be seen directly from Eq. (|58] ) by dimensional analysis (comparing the l.h.s. and 
the r.h.s., we have 1/?^ ~ uIn ~ 1 /N while^o ^ R/vtyp ~ 1 with the scalings introduced 
in Sec. 2.2) . This scaling, predicted in yO, has been observed for 2D Coulombian 
plasmas HIH. 



For one-dimensional systems, like the HMF model, the situation is different. For 
d = I, the kinetic equation (l58l) reduces to 

-^^2nm—J dv2dk-j^^^^,5{v, - V2) (/2^ - /i ^ j = 0. (62) 

Therefore, the collision term C/^f/] vanishes at the order 1/A^ because there is no 
resonance. The kinetic equation reduces to df/ dt = so that the distribution function 
does not evolve at all on a timescale ~ Nto- This implies that, for one-dimensional 
homogeneous systems, the relaxation time towards statistical equilibrium is larger than 
NtD- Thus, we expect that 

tR>NtD, {d=\). (63) 

For the HMF model, when the system remains spatially homogeneous, it is found that the 
relaxation time scales like Ir ~ [|l7|] . The fact that the Lenard-Balescu collision term 
vanishes in ID is known for a long time in plasma physics (see, e.g., the last paragraph 



in [|52D) and has been rediscovered recently in the context of the HMF model [|20L I53I 



3.6. The Vlasov-Landau equation for stellar systems 



Self-gravitating systems are spatially inhomogeneous, but the coUisional current can 
be calculated by making a local approximation [|54|] . Therefore, the evolution of the 
distribution function is governed by the Vlasov-Landau equation 

+ Vi-3- + (F)i- — = -T^ /i^^ Wv2, (64) 



dt ' dri ' " ^vi dv^J \ dv\ ' dv^y 

K^" = 2nmG^-\nA ( - , (65) 

where InA = jj^"'"' dk/k is the Coulomb factor (regularized with appropriate cut-offs) 
and we have set /i = f(r],yut) an d f? = /(ri,V2,0 assuming that the encounters can 



ana we nave sei ji = J{ri,\ui) an a t? = /(^ri, V2,rj assummg mai me encounters can 
be treated as local (see, e.g., ll46l[55l . [56[ |4ll for a more complete discussion). The Vlasov- 
Landau-Poisson system conserves the total mass and the total energy (kinetic + poten- 
tial) of the system. It also increases the Boltzmann entropy ( |34l) monotonically: 5g > 
(H-theorem). The mean field Maxwell-Boltzmann distribution (l36l) is the only stationary 
solution of this equation (cancelling both the advective term and the collision term in- 
dividually). Therefore, the system tends to reach the Boltzmann distribution. However, 
there are two reasons why it cannot attain it: (i) Evaporation: when coupled to the grav- 
itational Poisson equation, the mean field Maxwell-Boltzmann distribution (l36l) yields 
a density profile p ~ r^^ (for r — * +0°) with infinite mass so there is no physical dis- 
tribution of the form (l36l) in an infinite domain ifloL [isll . The system can increase the 
Boltzmann entropy indefinitely by evaporating. Therefore, the Vlasov-Landau-Poisson 
system has no steady state with finite mass and the density profile tends to spread in- 
definitely, (ii) Gravothermal catastrophe: if the energy of the system is lower than 
the Antonov threshold Ec = —0335GM^/R (where R is the system size), it will un- 
dergo core collapse. This is called gravothermal catastrophe [6] because the system 
can increase the Boltzmann entropy indefinitely by contracting and overheating. This 
proc ess usually dominates over evaporation and leads to the formation of binary stars 



[|54l I57L Il5|l . If the system is confined within a box so as to prevent evaporation, the 
Vlasov-Landau-Poisson system admits stationary solutions for sufficiently large ener- 
gies (above the Antonov threshold). They correspond to the mean field Boltzmann dis- 
tributions (l36l) . There can exist several Boltzmann distributions with the same values of 
mass and energy having different density constrasts ^ = p{0)/p{R). Since the Boltz- 
mann entropy is the Lyapunov functional of the Vlasov-Landau equation, a Boltzmann 
distribution (steady state) is linearly dynamically stable iff it is a (local) maximum of 
entropy at fixed mass and energy (thermodynamical stability). Therefore, only the local 
entropy maxima at fixed mass and energy can be reached in practice. This corresponds 
to configurations with density contrast M < 709. Minima and saddle points of entropy 
> 709) are unstable. The relaxation time to the Boltzmann distribution is 

~ r^j^D, (66) 

mN 

where the logarithmic correction comes from the divergence of the Coulombian factor 
InA ~ InA^. Since there is no global entropy maximum at fixed mass and energy for a 



self-gravitating system confined within a box ifloL flsll. the equilibrium configurations are 
only metastable (local entropy maxima). However, the lifetime of the metastable states 
is considerable, scaling like e^, so that these states are stable in practice ll24ll . Depending 
on the initial condition and on the structure of the basin of attraction, the system 
can either relax towards a long-lived metastable mean field Boltzmann distribution 
or undergo gravitational collapse (gravothermal catastrophe) and form a binary star 
surrounded by a hot halo (leading to a structure with unbounded entropy). 



3.7. Orbit-averaged kinetic equation 

We have seen in Sec. 3.5 that one dimensional systems that are spatially homogeneous 
do not evolve at all on a timescale ~ Nto or larger because of the absence of resonances. 
However, if the system is spatially inhomogeneous, new resonances can appear as 



described in [481 ] so that an evolution is possible on a timescale Nto- Then, we can expect 
that one dimensional inhomogeneous systems will tend to approach the Boltzmann 
distribution on the timescale Nto- To be more precise, let us consider the orbit-averaged- 
Fokker- Planck equation derived in Exploiting the timescale separation between 
the dynamical time and the relaxation time, we can average Eq. (I54l) over the orbits, 
assuming that at any stage of its evolution the system reaches a mechanical equilibrium 
on a short dynamical time. Therefore, the distribution function is a quasi-stationary 
solution of the Vlasov equation / ^ /(e, [where £ = v^/2 + 4> is the individual energy] 
slowly evolving in time under the effect of "collisions" (= correlations due to finite N 



effects). Introducing angle-action variables, we get an equation of the form |148| 
% = \§jl^j MA,n,n'{JJ')\^S{ma{J) -m'a{f)) !f(j')m^-f{J)m'^\dJ'. 

m,m' ^ ^ 

(67) 

The important point to notice is that the evolution of the system is due to a condition of 
resonance between the pulsations Q.{J) of the particles' orbits (this property extends to 
d dimensions). Only particles whose pulsations satisfy mQ.{J) = m'Q.{J') with (m,7) ^ 
{m' ,J') participate to the diffusion current. This is similar to the coUisional relaxation 



of two dimensional point vortices 11211 . |30|] . It can be shown that Eq. (|67]) conserves 
mass and energy and monotonically increases the Boltzmann entropy so that the system 
tends to approach the Boltzmann distribution of statistical equilibrium on a timescale 



~ Nto (the Boltzmann distribution is always a stationary solution of Eq. (|67]) ) ||48|] . 
However, it may happen that there is not enough resonances so that the system can 
be trapped in a quasi stationary state dijferent from the Boltzmann distribution. This 
happens when the condition of resonance cannot be satisfied so that mQ.{J) ^ m'Q.{J') 
for all (m,7) 7^ {m' ,J') for which |A„„,/(7,y') p 7^ 0. In that case, the system is in a steady 
state of Eq. (|67]) that is not the Boltzmann distribution. Indeed, Eq. (|67]) may admit 



other stationary solutions than the Boltzmann distribution [481]. This is what happens to 
point vortices in 2D hydrodynamics: the coUisional relaxation stops when the profile of 
angular velocity becomes monotonic even if the system has not reached the Boltzmann 
distribution [21]. In that case, the system will relax towards the Boltzmann distribution 
(if it truly does!) on a timescale larger than Nto (this requires to develop the kinetic 



theory at higher order in \/N, taking into account three body, four body,... correlations). 
We may wonder whether the same situation can happen to systems described by a kinetic 
equation of the form (|53l) . 



4. STATISTICAL THEORY OF VIOLENT RELAXATION: THE 

LYNDEN-BELL ENTROPY 

For t <^ Nto, the evolution of the system is governed by the Vlasov equation 

^ + --|^ + F-^ = 0' (68) 
dt dr d\ 

where F = — V4> is the force by unit of mass experienced by a particle. It is produced 
self-consistently by the other particles through the mean field 



4>(r,0 = j u{r-r')p{r',t)dr'. 



(69) 



Mathematically, the Vlasov equation is obtained when the A'^ +oo limit is taken before 
the t +00 limit. Indeed, the collision term in Eq. (1531) scales as l/N so that it vanishes 
for N — > +00. The Vlasov equation, or coUisionless Boltzmann equation, simply states 
that, in the absence of "collisions" (more properly correlations), the distribution function 
/ is conserved by the flow in phase space. This can be written df /dt — where 
d/dt = d /dt + \ ■ d /dr — V4> ■ d /d\ is the material derivative. The Vlasov equation 
conserves the usual constraints: the mass M and the energy E, but also an infinite 
number of invariants called the Casimirs. They are defined by 4 = / h{f)drd\ for 
any continuous function h{f). The conservation of the Casimirs is equivalent to the 
conservation of the moments of the distribution function denoted 

M„= [ fdrdy. (70) 



The Vlasov equation admits an infinite^number of stationary solutions whose gen- 
eral form is given by the Jeans theorem i54\ . Starting from an unstable initial condition 
/o(r,v), the Vlasov equation develops very complex filaments as a result of a mixing 
process in phase space (coUisionless mixing). In this sense, the fine-grained distribution 
function /(r, v, t) will never reach a stationary state but will rather produce intermingled 
filaments at smaller and smaller scales. However, if we introduce a coarse-graining pro- 
cedure, the coarse-grained distribution function /(r,v,?) will reach a metaequilibrium 
state /(r,v) on a very short timescale, of the order of the dynamical time to- This is 
because the evolution continues at scales smaller than the scale of observation (coarse- 
grained). This process is known as "phase mixing" and "violent relaxation" (or collision- 
less relaxation) [54]. Then, on a longer timescale, collisions (correlations) will come into 
play and drive the system towards the Boltzmann statistical equilibrium state. Lynden- 
Bell [[35I1 has tried to predict the metaequilibrium state, or quasi stationary state (QSS), 
achieved by the system through violent relaxation in terms of statistical mechanics. This 
approach is of course quite distinct from the statistical mechanics of the A^-body system 



(exposed in Sec. 2.4) which describes the statistical equilibrium state reached by a dis- 
crete A''-body Hamiltonian system for t +00. In Lynden-Bell's approach, we make the 
statistical mechanics of di field, the distribution function /(r, v, t) whose evolution is gov- 
erned by the Vlasov equation (l68l) while in Sec. 2.4 we made the statistical mechanics 
of a system of point particles described by the Hamilton equations ©. 

Let /o(r,v) denote the initial (fine-grained) distribution function (DF). We discretize 
/o(r, v) in a series of levels 77 on which /o(r, v) ~ 77 is approximately constant. Thus, 
the levels {77} represent all the values taken by the fine-grained DF. If the initial 
condition is unstable, the distribution function /(r, v,f) will be stirred in phase space 
(phase mixing) but will conserve its values r\ and the corresponding hypervolumes 
7(7] ) = / 5 (/(r, V, ? ) — r})drd\ as a property of the Vlasov equation (this is equivalent to 
the conservation of the Casimirs). Let us introduce the probability density p(r, v, 7/) of 
finding the level of phase density 77 in a small neighborhood of the position r, v in phase 
space. This probability density can be viewed as the local area proportion occupied by 
the phase level r] and it must satisfy at each point the normalization condition 

p(r,v,77)j7] = 1. (71) 

The locally averaged (coarse-grained) DF is then expressed in terms of the probability 
density as 

7(r,v) = j p(r,v,7])7]j7], (72) 
and the associated macroscopic potential satisfies 

^{r,t) = j u{r-r')f{r'y,t)dr'dv'. (73) 

Since the potential is expressed by space integrals of the density, it smoothes out the 
fluctuations of the distribution function, supposed at very fine scale, so ^> has negligible 
fluctuations (we thus drop the bar on ^»). The conserved quantities of the Vlasov equation 
can be decomposed in two groups. The mass and energy will be called robust integrals 
because they are conserved by the coarse-grained distribution function: M[f] — M\f\ 

M= [fdrdy, (74) 



and E[f] ^ £ [/] . Hence 



-fv^drdv + 2 / / ^drd\. (75) 

As discussed above, the potential can be considered as smooth, so we have expressed 
the energy in terms of the coarse-grained fields / and 4> neglecting the internal energy of 

the fluctuations /4>. Therefore, the mass and the energy can be calculated at any time of 
the evolution from the coarse-grained field /. By contrast, the moments M,, with n>2 
will be called fragile integrals because they are altered on the coarse-grained scale since 
/" 7^ / (the local moments of the distribution are defined by /"(r,v,?) = / pr\'^dr]). 
Therefore, only the moments of the fine-grained field mI'^' = M„[/] = / f"drd\ are 



conserved, i.e. 

M/'=/p(r,v,,),Vr</v.,. (76) 

The moments of the coarse-grained field Mn^'[f] = J f drd\ are not conserved during 
the evolution since M„[/] 7^ In a sense, the moments M^'^' are "hidden con- 

straints" because they are expressed in terms of the fine-grained distribution p(r, v, T]) 
and they cannot be measured from the (observed) coarse-grained field. They can be 
only computed from the initial conditions before the system has mixed or from the fine- 
grained field. Since in many cases we do not know the initial conditions nor the fine- 
grained field, they often appear as "hidden". Note that instead of conserving the fine- 
grained moments, we can equivalently conserve the total hypervolume 7(77) = / pdrd\ 
of each level r\ . 

After a complex evolution, we may expect the system to be in the most probable, 
i.e. most mixed state, consistent with all the constraints imposed by the dynamics. This, 
however, relies on a hypothesis of ergodicity (efficient mixing) whose validity will be 
discussed in Sec. 6. We define the mixing entropy as the logarithm of the number of 
microscopic configurations associated with the same macroscopic state characterized by 
the probability density p(r, v, 7]). To get this number, we divide the macrocells (r, r + 
Jr;v,v + Jv) into v microcells of size h and denote by tiij the number of microcells 
occupied by the level r\j in the z-th macrocell. Note that a microcell can be occupied 
only by one level r\j. This is due to the fact that we make the statistical mechanics of a 
continuous field /(r, v,?) instead of point particles. Therefore, we cannot "compress" 
that field, unlike point particles. A simple combinatorial analysis indicates that the 
number of microstates associated with the macrostate {n/j} is 

W'(K}) = n^;!n7T' (77) 

/ 'No- 
where Nj — Y^i^ij is the total number of microcells occupied by r\j (this is a conserved 
quantity equivalent to y{T\))- We have to add the normalization condition = 
equivalent to Eq. (TtTI) . which prevents overlapping of different levels (we note that we 
treat here the level r] = on the same footing as the others). This constraint plays a 
role similar to the Pauli exclusion principle in quantum mechanics. Morphologically, 
the Lynden-Bell statistics (1771) corresponds to a 4* type of statistics since the particles 
are distinguishable but subject to an exclusion principle [35]. There is no such exclusion 
for the statistical equilibrium of point particles since they are free a priori to approach 
each other, so we can put several particles in the same microcell (see Sec. 2.4). 
Taking the logarithm of W and using the Stirling formula, we get 

\nW{{n,j]) = -Y^n.jXnmj = -'^vh^'^Pijlnpij, (78) 

iJ hi 

where pij = p(r/, v/, rjj) = njj/vh^. Passing to the continuum limit v ^ 0, we obtain 



the mixing entropy introduced by Lynden-Bell [1350 



SlbIp] = - / p(r,v,r7)lnp(r,v,T])JrJvJr]. 



(79) 



Note that the Lynden-Bell entropy can be interpreted as the Boltzmann entropy for a dis- 
tribution of levels T] (including 77 = 0). Equation (|79|) is sometimes called a collisionless 
entropy to emphasize the distinction with the collisional entropy (l34l) . Assuming ergod- 
icity or "efficient mixing", the statistical equilibrium state is obtained by maximizing 
the Lynden-Bell entropy Slb[p] while conserving the mass M, the energy E and all the 
Casimirs (or moments M,,). We need also to account for the local normalization condi- 
tion (ItTI) . This problem is treated by introducing Lagrange multipliers, so that the first 
order variations satisfy 



55LB-j85£-£a„5M„- j C(r,v)5 (^y'p(r,v, 77)^77^ JrJv = 0, 



(80) 



where j8 is the inverse temperature and a„ the "chemical potential" associated with Mn. 
The resulting optimal probability density is a Gibbs state which has the form 

p(r,v,77) = ^Z(^7K(^'+"^^ (81) 

2 

where e = y + ^(r) is the energy of a particle by unit of mass. In writing Eq. (fSlI) . we 

have distinguished the Lagrange multipliers a and /3 associated with the robust integrals 

M and E from the Lagrange multipliers a„>i, associated with the conservation of the 

fragile moments M„>i = / pr}"dr}drd\, which have been regrouped in the function 

XiTi) = exp( — X^„>i a„T]"). This distinction will make sense in the following. Under this 

form, we see that the equilibrium distribution of phase levels is a product of a universal 

Boltzmann factor by a non-universal function xil) depending on the initial 

conditions. The partition function Z is determined by the local normalization condition 

J pdr] — I leading to 

Z = j x{ri)e-'^^l^'+"Uri. (82) 

Finally, the equilibrium coarse-grained distribution function defined by / = / pridrj can 
be written 



or, equivalently. 



f=-^^=F{l5e + a)=f{e). (84) 



The function F is given by 



F(x) = -(lnz)'W, (85) 



where we have defined x{x) = f^l'"' xi'n)^^^'^^''!- It straightforward to check that a 
distribution function f = /(e) depending only on the energy £ is a stationary solution of 
the Vlasov equation [54]. Therefore, for a given initial condition, the statistical theory of 
Lynden-Bell selects a particular stationary solution of the Vlasov equation (most mixed) 



among all possible ones (an infinity!)0. Specifically, the equilibrium state is obtained by 
solving the integro-differential equation 



$(r) = J M(r-rO/«„,^[v'V2 + 4>(r')]^rW, 



(86) 



and relating the Lagrange multipliers j8 to the constraints M„, E. Then, we have 
to make sure that the distribution is a maximum of Sib, not a minimum or a saddle 
point. We note that the coarse-grained distribution function /(e) predicted by Lynden- 
Bell can take a wide diversity of forms depending on the function xiv) determined by 
the fragile moments ("hidden constraints") fixed by the initial condition. The coarse- 
grained distribution function (l83l) can be viewed as a sort of superstatistics as it is 
expressed as a superposition of Boltzmann factors (on the fine-grained scale) weighted 
by a non-universal function x{f]) iSS- Some examples will be given in the sequel. In 
the present context, the function x{^) is determined from the constraints a posteriori. 
Indeed, we have to solve the full problem in order to get the expression of xi^)- In 
this sense, the constraints associated with the conservation of the fine-grained moments 
are treated microcanonically H. We emphasize that the function /(e) determining the 
metaequilibrium state depends on the details of the initial conditions. This differs from 
the ordinary statistical equilibrium state (l36l) which only depends on the mass M and the 



energy E. Here, we need to know the value of the fine-grained moments M^{> j which are 
accessible only in the initial condition (or from the fine-grained field) since the observed 
moments are altered for ? > by the coarse-graining as the system undergoes a mixing 
process (M^'j^j 7^ ^f>i)- This makes the practical prediction of /(e) very complicated, 
or even impossible, since, in practice, we often do not know the initial conditions in 
detail (e.g., the initial state giving rise to a galaxy or to a large scale vortex). In addition, 
in many cases, we cannot be sure that the initial condition is not already mixed (coarse- 
grained). If it has a fine-grained structure, this would change a priori the prediction of 



the metaequilibrium state (see p. 284 of [250 and Il60n ). 

We note that the coarse-grained distribution function predicted by Lynden-Bell de- 
pends only on the individual energy e of the particles. According to the Jeans theorem 



[|54|]. such distribution functions form just a particular class of stationary solutions of 
the Vlasov equation. We also note that /(e) is a monotonically decreasing function of 
energy. Indeed, from Eqs. (|8TI) and (|84|) . it is easy to establish that 

/(e) = -/3/2, fi^j Pin -ffdr] > 0, (87) 



Incidentally, the fact that the coarse-grained distribution function should be a stationary solution of the 
Vlasov equation is not obvious. This depends on the definition of coarse-graining |580. 
^ In the context of two-dimensional turbulence, some authors 1I4TI l59ll have proposed that the vorticity 
distribution J (cj) could be imposed by a small-scale forcing, in which case it should be treated canonically 
(as a prior distribution). It is not clear whether there exists a similar interpretation for other systems with 
long-range interactions (note, however, that a small-scale forcing is present in certain models of dark 
matter in astrophysics). Here, we exclusively consider isolated systems where the distribution xi'H) is 
determined implicitly by the initial conditions. 



where f2 = f^ — f' is the centered local variance of the distribution p (r, v, 77 ) . Therefore, 
/ (e) < since /3 > is required to make the velocity profile normalizable. We also have 
F'{x) = —fii^) so that F is a monotonically decreasing function. Finally, the coarse- 
grained distribution function satisfies /(r, v) < /q"" where /q"^^ = /max(r,v,? = 0) is 
the maximum value of the initial (fine-grained) distribution function. This inequality 
can be obtained from Eq. (l83l) by taking the limit £ —00 for which /(e) Tjmax — 
j-max using the fact that f{e) is a decreasing function. Of course, the inequality 
< / < /q"^ is clear from physical considerations since the coarse-grained distribution 
function locally averages over the fine-grained levels. Since the fine-grained distribution 
function is conserved by the Vlasov equation, the coarse-grained distribution function is 
always intermediate between the minimum and the maximum values of /q. We also 



note 116 ih that the most probable level r]*(r, v) of the distribution (ISTI) is given by 
77* = [{\nxy]^^{li£ + a), provided that (ln;t;)"(?7*) < 0. The function T]*(r,v) is a 
stationary solution of the Vlasov equation of the form 77* = 77*(e) which monotonically 
decreases with the energy since rj^e) = P /{Inx)" {t}^) < 0. It usually differs from the 
average value /(r, v) of the distribution (ISTI) . 

If the initial DF takes only two values /o = and /o = rjo, the Lynden-Bell entropy 
becomes _ _ _ _ 

Slb = - /{ — ln^+( 1-^ )ln(l- — U^r^v, (88) 
J [rio rio V rio J \ rioJ } 

which is similar to the Fermi-Dirac entropy (but with, of course, a completely different 
interpretation). In that case, the statistical prediction of Lynden-Bell for the metaequi- 
librium state resulting from a "coUisionless" violent relaxation is 



^ l+eT!!()8e+«)' ^^^^ 

which is similar to the Fermi-Dirac distribution H. This has to be contrasted from 
the statistical equilibrium state resulting from a "coUisional" relaxation which is the 
Maxwell-Boltzmann distribution (|36l) . In the dilute limit of Lynden-Bell's theory / ^ 
770, the DF dHSl) becomes 

J = ^g-/377oe (90) 

This is similar to the statistical equilibrium state (l36l) . Therefore, in this approximation, 
coUisional and coUisionless relaxation lead to the same distribution function (the mean 
field Maxwell-Boltzmann distribution) but with a completely different interpretation, 
corresponding to very different timescales. To emphasize the difference, note in partic- 
ular the bar on / in Eq. (|90| ) and the fact that the mass of the individual stars m in Eq. 
(|36l) is replaced by the value r/o of the fine-grained distribution function in Eq. (|90l ). 

In conclusion, assuming ergodicity, the Lynden-Bell statistical equilibium state 
p (r, V, T] ) is obtained by solving the maximization problem 



max 
P 



{SlbIp] I E[p]=E,M[p]=M,Ml^,[p]=Mfj-^Jpdr^ = \}. (91) 



^ Degeneracy effects associated with the Lynden-Bell statistics (|89] l have been observed in 2D turbulence 
ll63ll . self-gravitating systems S and for the HMF model ll27ll . 



This corresponds to a condition of microcanonical stability. Let us introduce the Lynden- 
Bell free energy i^Lfi[p] = E[p] — TSlb[p] - The condition of canonical stability is 



mm 

P 



{Flb[p] I M[p]=M,Ml^,[p]=Ml^,Jpdn = l}. (92) 



The optimization problems (|9T1) and (|92l ) have the same critical points. Furthermore, if 
p (r, V, 77 ) solves the minimization problem (l92l) then it solves the maximization problem 
(|9T| ) and represents therefore a maximum entropy state. However, the reciprocal is wrong 
in case of ensemble inequivalence. A solution of (|9TI) is not necessarily a solution of 
(|92|) : the minimization problem (|92| ) just determines a subclass of the solutions of the 
maximization problem (|9TI) . Therefore, the condition of canonical stability (|92l) provides 
just a sufficient condition of microcanonical stability: (|92| ) ^ (|9T| ). 

Numerical algorithms: A relaxation equation has been proposed to solve the max- 
imization problem (|9T| ). It is obtained from a Maximum Entropy Production Principle 
(MEPP) and has the form of a generalized mean field Fokker- Planck equation [[ssl] . It can 
serve as a numerical algorithm to determine the Lynden-Bell maximum entropy state for 
given E, M and Mj^^'^ specified by the initial conditions. The relaxation equation solving 
the microcanonical problem (|9T|) is 



dt 
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with 



jD^-\drd\ 



(93) 



(94) 



jDf2V^drd\ ' 

This equation conserves the mass M, the energy E (through the time dependent Lagrange 
multiplier j8(?)) and the fine-grained moments and monotonically increases the 

Lynden-Bell entropy: Slb > (//-theorem). Furthermore, Slb = iff p is a steady solu- 
tion of Eq. (|93l) . Therefore, Slb[p] is the Lyapunov functional of the relaxation equation 
(1931) . It results that p (r, v, 77 ) is a linearly dynamically stable stationary solution of Eq. 
(1931 ) iff it is a (local) maximum of Sib at fixed E, M and . Minima or saddle points 
of entropy are dynamically unstable. Furthermore, if Slb is bounded from above (within 
the previous constraints) we can conclude for Lyapunov's theory that p (r, v, 7] , ?) will re- 
lax towards a (local) maximum of Slb at fixed E, M and for t +00 (if there exists 
several local entropy maxima, the selection will depend on a complicated notion of basin 
of attraction). Therefore, Eq. (|93l) with Eq. (|94l) can serve as a numerical algorithm to 
solve the microcanonical problem (|9TI) and compute the Lynden-Bell statistical equilib- 
rium state. It can also provide a phenomenological description of the out-of-equilibrium 
dynamics towards Lynden-Bell's statistical equilibrium pS)]. 

If we fix the inverse temperature /3 instead of the energy in Eq. (|931) , we get a 
relaxation equation solving the canonical problem (|92l ). This equation conserves the 
mass M and the fine-grained moments A/,f>j, and monotonically decreases the Lynden- 
Bell free energy: Flb < 0. Furthermore, Flb = iff p is a steady solution of (|93l ) with 
fixed /3. Therefore, Flb[p] is the Lyapunov functional of this relaxation equation. It 



results that p(r, v, r/) is a linearly dynamically stable stationary solution of Eq. (|93l) 
with fixed /3 iff it is a (local) minimum of Fib at fixed M and M^{^j . Furthermore, if 
Fib is bounded from below, p(r, v, 7],?) will relax towards a (local) minimum of Fib at 
fixed M and . Therefore, Eq. (|93] ) with fixed /3 can serve as a numerical algorithm 
to solve the canonical problem (|92l) and determine a subclass of Lynden-Bell statistical 
equilibrium states since ( |92| ) ^ ( |9T| ). 



5. GENERALIZED ENTROPIES IN THE REDUCED SPACE OF 
COARSE-GRAINED DISTRIBUTIONS 



Assuming ergodicity, we have seen that the most probable local distribution of phase 
levels p(r, V, r]) maximizes the Lynden-Bell mixing entropy (1791) while conserving 
mass, energy and all the fine-grained moments. This functional of p is the proper 
form of Boltzmann entropy in the context of the violent relaxation. It is obtained 
by a combinatorial analysis taking into account the specificities of the coUisionless 
(Vlasov) evolution. We shall first show that the equilibrium coarse-grained distribution 
function /(r, v) (which is the function directly accessible to the observations) extremizes 



a certain functional S[f] at fixed mass M and energy E [|65l . \i6\ . This functional is 
non-universal and depends on the initial conditions. It is determined indirectly by the 
statistical theory of Lynden-Bell and cannot be obtained from a combinatorial analysis, 
unlike ^[p] . Such functionals arise because they encapsulate the influence of fine-grained 
constraints (Casimirs) that are not accessible on the coarse-grained scale. They play the 
role of "hidden constraints". Now, if the distribution of phase levels x{'r]) is treated 
canonically, it can be shown that the equilibrium coarse-grained distribution function 
/(r, v) maximizes S[f] at fixed mass M and energy E. In this sense, S[f] will be called a 
"generalized entropy" in the reduced space of coarse-grained distributions 0. A coarse- 
grained DF that maximizes a "generalized entropy" S[f] at fixed mass and energy is a 
Lynden-Bell statistical equilibrium state (but the reciprocal is wrong in case of ensemble 
inequivalence). We note that the entropic functionals S[p] and S[f] are defined on two 
different spaces. The p-space is the relevant one to make the statistical mechanics of 



violent relaxation [13511 . The /-space is a sort of projection of the p-space in the space of 
directly observable (coarse-grained) distributions. 

Since the coarse-grained distribution function /(g) predicted by the statistical theory 
of Lynden-Bell depends only on the individual energy and is monotonically decreasing 



(see Sec. 4), it extremizes a functional of the form [|65l.l3 



S[f] = - / C(/) JrJv, (95) 



^ We emphasize that this notion of "generaUzed entropies" ll65l[36ll is completely different from the notion 
of generalized entropies introduced by Tsallis (see Sec. 7). These entropies take into account the influence 
of the Casimir constraints for an ergodic evolution while the Tsallis entropies aim at describing non- 
ergodic behaviours. 



at fixed mass M and energy E, where C(/) is a convex function, i.e. C" > 0. Indeed, 
introducing Lagrange multipliers and writing the variational principle as 

8S-pdE-a5M = 0, (96) 

we find that 

C'(7) = -/3e-a. (97) 

Since C' is a monotonically increasing function of /, we can inverse this relation to 
obtain 

7 = F(/3e + «)=/(£), (98) 

where 

F{x) = {C')-\~x). (99) 

From the identity 

/(£) = -/3/C''(7), (100) 

resulting from Eq. (l97l) . /(e) is a monotonically decreasing function of energy (if /3 > 0). 
Thus, Eq. dSll) is compatible with Eq. ^ provided that we identify ^ with (l85l) . 
Therefore, to any function F(jc) determined by the function ;f(T]) in the statistical theory 
according to Eq. (l85l) . we can associate to the metaequilibrium state (|84l) a functional 
(|95l) where C(/) is given by Eq. (l99l) or equivalently by 

C(J) = - F-\x)dx. (101) 
Using Eq. (f85l) . this function is explicitly given in terms of ^ by 

C{J) = - j\{\nx)']-\-x)dx. (102) 

On the other hand, comparing Eqs. (|87l) and (llOOl) . we find that 

/2 = 1/C"(7). (103) 

This equation relates the variance of the equilibrium distribution /2 to the coarse- 
grained distribution function / through the function C. We note that C(/) is a non- 
universal function which depends on the initial conditions. Indeed, it is determined by 
the function x{t]) which depends indirectly on the initial conditions through the com- 
plicated procedures discussed in Sec. 4. In general, S[f] is not the Boltzmann functional 
= — / flnfdrdy (except in the dilute limit of the Lynden-Bell theory) due to fine- 
grained constraints (Casimirs) that modify the form of entropy that we would naively 
expect. This is why the quasi-stationary state is described by non-standard distributions 
(even for an assumed ergodic evolution). The existence of "hidden constraints" (here 
the Casimir invariants that are not accessible on the coarse-grained scale) is the physi- 
cal reason for the occurrence of non-standard distributions and "generalized entropies" 
in our problem. In fact, the distribution is standard (Boltzmann) at the level of the lo- 
cal distribution of fluctuations p (r, v, r] ) (p -space ) and non-standard at the level of the 
macroscopic coarse-grained field f{r,y) (f -space). 



So far, we have shown that the coarse-grained DF extremizes the functional (|95] )- 
(11021 ) at fixed mass and energy. The conditions under which it maximizes this functional 



at fixed mass and energy have been discussed recently by Bouchet [|66n . In the following, 



we give a complementary discussion, adopting a presentation similar to the one used in 



[|42n in a different context (see also Sec. 11). We introduce the grand entropy [13611 



SM = Slb[p] - £ an Mf-^-[p], (104) 

n>l 

which is the Legendre transform of Slb[p] with respect to the fragile moments M^^'^ [p] . 
Expliciting these fine-grained moments, it can be rewritten 



drd\dr\, (105) 



where we have noted 



Z(77)=exp|-£a,r]"|. (106) 
I «>i ) 



We now consider the grand microcanonical ensemble defined by (Sf 

{S^[p] I E[p]=E,M[p]=M,Jpdr) = l}. (107) 



max 
P 



If p(r,v, 77) solves the maximization problem (11071) . then it solves the maximization 
problem (|9T| ) and represents therefore a Lynden-Bell maximum entropy state. Note that 
the reciprocal is wrong in case of ensemble inequivalence [[ill a solution of ([9T1) 
is not necessarily a solution of ([107[) . In other words, the maximization problem ([107[) 
determines only a sublclass of solutions of the maximization problem ([9T1) . Therefore, it 
only provides a sufficient condition of Lynden-Bell thermodynamical stability. Note that 
we can also consider the grand canonical ensemble 



mm {Fx[p]=E-TSx[p] \ M[p] = M, J pdr] = I}, (108) 

where F^[p] is a grand free energy. We have the implications: ([108[) =^ ([107[) =^ (|9TI) 
and ([108[) =^ dg!]) =^ (19B so that ([108[) gives a sufficient condition of Lynden-Bell ther- 
modynamical stability (less refined than ([107[) ). In the following, we shall consider the 
maximization problem ([107[) . To determine the distribution (r, v, rj ) which maximizes 
S),[p] with the robust constraints E[f] = E, M[f] = M, and the normalization condi- 
tion J p drj = I, we can proceed in two steps (we consider here the global maximiza- 
tion problem. The local maximization problem is treated in the Appendix). First step: 
we determine the distribution pi(r,v, T]) which maximizes S^[p] with the constraint 
J pdrj = 1 and a fixed coarse-grained distribution J prjdrj = /(r, v) (note that fixing 
the coarse-grained distribution function automatically determines the robust constraints 
E and M). This gives a distribution pi[/(r, v), 77] depending on /(r,v) and rj. Substi- 
tuting this distribution in the functional 5';^[p], we obtain a functional S[f] = S^[pi] of 
the coarse-grained distribution /. Second step: we determine the distribution function 



/^(r, v) which maximizes S[f] with the constraints £ [/] = E and M[f] = M. Finally, we 
have p*(r,v, 77) = pi[/^(r,v),77]. Let us now show that S[f] = S^lpi] is the functional 
(11021) . The distribution pi(r,v, 7]) that extremizes 5;^[p] with the constraints J pdrj = I 
and / prj drj = /(r, v) satisfies the first order variations 

dS^- J ^{r,\)5(^J pr]dr]^drdy- j C{r,\)5(^J pdri^drd\ = 0, (109) 

where ^•(r,v) and C(r,v) are Lagrange multipliers. This yields 

pi(r,v,77) = -^z(^)e"''*^'"'^\ (110) 
Z(r,v) 

where Z(r,v) and $(r,v) are determined by 

Z(r,v) =1 x{l)e-'''^^'^'^dr^=x{^), (111) 

7(r,v) = / xWle-^'^^'^-'UT^ = -{InxYm. (112) 

The critical point (II 101) is a maximum of 5;^[p] with the above-mentioned constraints 
since 5^5;^ = — /[(5p)^/p]JrJvJ77 < 0. Then, we compute 



Sx[pA = - J pi{-ri^-^^X)drd\dri = J {f^ + \nx{^))drdy. 



(113) 



Therefore, the functional of the coarse-grained DF is 

S[f] = -jc(J)drdy, (114) 

with 

C(f) = -J^-\nxm. (115) 
Now 4>(r, v) is related to /(r,v) by Eq. (|112|) . This implies that 

C'{f) = -^ = -[{\nx)r\-l). (116) 

so that 

C(f) = - j\{\nx)'r\-^)dx. (117) 

This returns the functional (11021 ). We now conclude that p (r, v, r\ ) solves the maximiza- 
tion problem (11071) if, and only if, /(r, v) solves the maximization problem 

m_ax {S[f\ I E[f\=E,M[J\^M}, (118) 

/ 



where S[f] is given by Eqs. (II 141) and (II 171) . The optimal distributions are related to 
each other by p*(r,v, 7]) = pi[/^(r,v),77] according to the procedure described previ- 
ously. We have the implications: (11181) (11071) =^ (|9T1) . Therefore, if /(r,v) solves 
the maximization problem (II 181) . the corresponding distribution p(r, v, r/) solves the 
maximization problem (|9T| ) and is therefore a Lynden-Bell statistical equilibrium state. 
However, the reciprocal is wrong in case of ensemble inequivalence. The maximum 
entropy state satisfying (|9T1) may not satisfy (11071) or (II 181) . Therefore, although the 
coarse-grained DF always extremizes S[f\ at fixed E, M, it is not a maximum of S[f] at 
fixed M in the region of ensemble inequivalence. These maximization problems (1107 1 
or (II 181) provide only sufficient conditions of Lynden-Bell thermodynamical stability 1^. 
Returning to the maximization problem (II 181) . we introduce the generalized free energy 
F[f] = E [/] — TS[f] . Now, if /(r, v) solves the minimization problem 

min {F[f] I M[f\=M}, (119) 
/ 

then it solves the maximization problem (II 181) . However, the reciprocal is wrong in 
case of ensemble inequivalence: the solutions of (II 181) are not necessarily solutions of 
(|1 191) . Therefore, the generalized canonical problem (II 191) determines only a sublclass 
of solutions of the generalized microcanonical problem (II 181) . We have the implications: 
(ITT91) ^ (fTTSl) ^ (flOTl) ^ (ED and (fTT9l) ^ (flOSl) ^ (fTOTl) ^ (EB • 

Numerical algorithms: The numerical algorithms solving the optimization problems 
(|1 181) and (II 191) will be presented in Sec. 10 in connection with the formal nonlinear 
dynamical stability with respect to the Vlasov equation. 



6. INCOMPLETE VIOLENT RELAXATION 

The statistical approach presented previously rests on the assumption that the collision- 
less mixing is efficient so that the ergodic hypothesis which sustains the statistical theory 
of Lynden-Bell is fulfilled. There are situations where the Lynden-Bell prediction works 
relatively well. However, there are other situations where the Lynden-Bell prediction 



fails. It has been understood since the beginning [|35l] that violent relaxation may be in 



complete in certain cases so that the Lynden-Bell mixing entropy is not maximized in 



the whole available phase space. Incomplete relaxation [|67l] can lead to more or less 



severe deviations from the Lynden-Bell prediction. Physically, the system tends to reach 



^ Note that the maximization problem ( II 18l l determines a state that is both Lynden-Bell thermodynam- 
ically stable with respect to fine grained perturbations 5p(r,v,77) (according to criterion (l9Tl l <= ( 1118! )) 
and formally nonlinearly dynamically Vlasov stable with respect to coarse-grained perturbations 5/(r, v) 
(according to criterion ( 1136b ). On the other hand, since (jOTj) does not imply ( 1118! ) in case of ensemble in- 
equivalence, we conclude that there may exist stable Lynden-Bell statistical equilibrium states that do not 
satisfy the criterion of formal nonlinear dynamical stability ( !736D . Therefore, either; (i) the ensembles are 
always equivalent with respect to the conjugate variables j , 0C„> i ) (ii) Lynden-Bell thermodynam- 
ical stability does not imply formal nonlinear dynamical stability (iii) there exists a stronger condition 
of nonlinear dynamical stability than (!136! ). This intriguing point needs further development. In particu- 
lar, it would be interesting to exhibit a situation of ensemble inequivalence with respect to the conjugate 
variables (M^-^fj , a„>i). 



the Lynden-Bell maximum entropy state during violent relaxation but, in some cases, it 
cannot attain it because the variations of the potential, that are the engine of the evo- 
lution, die away before the relaxation process is complete (there may be other reasons 
for incomplete relaxation). Since the Vlasov equation admits an infinite number of sta- 
tionary solutions, the coarse-grained distribution /(r, v, t) can be trapped in one of them 
and remain frozen in that quasi stationary state /g55(r,v) until coUisional effects fi- 
nally come into play (on longer timescales). This steady solution is not always the most 
mixed state (it can be only partially mixed) so it may differ from Lynden-Bell's statis- 
tical prediction. Thus, for dynamical reasons, the system does not always explore the 
whole phase space ergodically. In general, the statistical theory of Lynden-Bell gives a 
relatively good first order prediction of the QSS without fitting parameter and is able to 
explain out-of-equilibrium phase transitions between different types of structures, de- 
pending on the values of the control parameters fixed by the initial condition. However, 
there are cases where the prediction does not work well (it can sometimes be very bad) 
because of incomplete relaxation. Other stationary solutions of the Vlasov equation, dif- 
fering from Lynden-Bell's prediction, can arise in case of incomplete violent relaxation. 
The difficulty is that we do not know a priori whether the prediction of Lynden-Bell will 
work or fail because this depends on the dynamics and it is difficult to know in advance 
if the system will mix well or not. Let us give some examples of complete and incom- 
plete violent relaxation in stellar systems, 2D turbulence and for the HMF model (for 
more discussion, see [|67l.l4l]'). 

Stellar systems: The concept of violent relaxation was first introduced by Lynden- 
Bell 113 511 to explain the apparent regularity of elliptical galaxies in astrophysics. How- 
ever, for 3D stellar systems the prediction of Lynden-Bell leads to density profiles whose 
mass is infinite (the density decreases as r^^ at large distances). In other words, there is 
no maximum entropy state at fixed mass and energy in an unbounded domain. Further- 
more, it is known that the distribution functions (DP) of galaxies do not only depend on 
the energy e = v^/2-|-4»(r) contrary to what is predicted by the Lynden-Bell statisti- 
cal theory. This means that other ingredients are necessary to understand their structure. 
However, the approach of Lynden-Bell is able to explain why elliptical galaxies have an 
almost isothermal core. Indeed, it is able to justify a Boltzmannian distribution / ~ 
in the core without recourse to collisions which operate on a much longer timescale 
tR ~ {N /lnN)ti). However, violent relaxation is incomplete in the halo. The concept of 
incomplete violent relaxation explains why galaxies are more confined than predicted by 
statistical mechanics (the density profile of elliptical galaxies decreases as r^^ instead 
ofr-2 IH). 

For one dimensional self-gravitating systems, the Lynden-Bell entropy has a global 
maximum at fixed mass and energy in an unbounded domain. Early simulations of 
the ID Vlasov-Poisson system starting from a water-bag initial condition have shown 
a relatively good agreement with the Lynden-Bell prediction In other cases, the 
Vlasov equation (and the corresponding A^-body system) can have a very complicated, 
non-ergodic, dynamics. For example, starting from an annulus in phase space, Mineau 
et al. 16811 have observed the formation of phase-space holes which block the relaxation 
towards the Lynden-Bell distribution. In that case, the system does not even relax 
towards a stationary state of the Vlasov equation but develops everlasting oscillations. 



Two-dimensional vortices: In the context of two-dimensional turbulence, Miller 



[|69ll and Robert & Sommeria []7(J] have developed a statistical mechanics of the 2D 
Euler equation which is similar to the Lynden-Bell theory (see [13 811 for a description 
of this analogy). This theory works relatively well to describe vortex merging |71] or 
the nonlinear development of the Kelvin-Helmholtz instability in a shear layer ll63n . It 
can account for the numerous bifurcations observed between different types of vortices 



(monopoles, dipoles, tripoles,...) ||25|, 1720 and is able to reproduce the structure of 



geophysical and jovian vortices like Jupiter's great red spot [|73L l74l . 

However, some cases of incomplete relaxation have been reported. For example, in 
the plasma experiment of Huang & DriscoU 17511 . the Miller-Robert-Sommeria (MRS) 
statistical theo ry g ives a reasonable prediction of the QSS without fit but the agreement 
is not perfect nl6\ . The observed central density is larger than predicted by theory and 
the tail decreases more rapidly than predicted by theory, i.e. the vortex is more confined. 
This is related to the fact that mixing is not very efficient in the core and in the tail of 
the distribution (this can be explained by invoking the concept of "maximum entropy 
bubbles" to delimitate the region where the vorticity mixes well lf72ll or by developing a 
kinetic theory of violent relaxation, see fsol l and Sec. 8). As noted by Boghosian [771, in 
this case the QSS can be fitted by a Tsallis distribution where the density drops to zero 
at a finite distance. 

The HMF model: Recently, Antoniazzi et al. [27] have performed numerical simula- 
tions of the HMF model, starting from a water bag initial condition, to test the prediction 
of the Lynden-Bell theory. For a given value of the energy U = 0.69, this theory predicts 
an out-of-equilibrium phase transition between a spatially homogeneous phase and a 
spatially inhomogeneous phase above a critical magnetization Merit = 0.897 discovered 
in [2^, 27I1. Numerical simulations show a relatively good agreement with the Lynden- 
Bell theory when the initial magnetization Mq < Merit- However, for Mq > Merit several 
authors report important deviations to the Lynden-Bell theory. In particular, for Mq = 1, 
Latora et al. [|32ll find that the QSS is spatially homogeneous (Mqss = 0) and the velocity 
distribution is non-gaussian, while the Lynden-Bell theory predicts a spatially inhomo- 
geneous state with gaussian tails (the same as the Boltzmann equilibrium state, since 
we are in the non-degenerate limit i4j\). Using isotropic water-bag initial conditions, 
Campa et al. JItIi show that the QSS is well-fitted by a semi-elliptical distribution. As 
remarked in 14], this corresponds to a Tsallis distribution with a compact support (with 
index q = 3) so that the distribution function drops to zero above a finite velocity. This 
confinement is a typical result of incomplete relaxation and can be explained by a kinetic 
theory of violent relaxation (see ^ and Sec. 8). 

On the other hand, using different initial conditions, Morita & Kaneko [[7 811 find that 
the system does not relax to a QSS but exhibits oscillations whose duration diverges 
with A^^. Therefore, the Lynden-Bell prediction clearly fails. This long-lasting periodic 
or quasi periodic collective motion appears through Hopf bifurcation and is due to the 
presence of clumps (high density regions) in phase space. We remark that this behaviour 
is relatively similar to the one reported by Mineau et al. [|68ll for self-gravitating systems, 
except that they observe phase space holes instead of phase space clumps. 



7. GENERALIZED THERMODYNAMICS: THE TSALLIS 

ENTROPY 



To account for the non-Boltzmannian nature of the QSS, some authors [TtLSjIt^ have 
proposed to change the form of the entropy. In particular, it has been proposed that the 
QSS could be in the "most probable" state that maximizes the Tsallis entropy 



Sq[f] = J (/^(r,v) -/(r,v))JrJv, 



(120) 



at fixed mass and energy. This leads to a distribution function of the form /^(r, v) = 
[jl — I5{q— l)e/t7]^/(^^^) (where /i and /3 are Lagrange multipliers determined by the 
mass and the energy) which coincides with the Boltzmann distribution for = 1 and 
differs from it when q ^ I. Therefore, the Tsallis entropy (11201) can be viewed as a 



generalization of the Boltzmann entropy Il37h . However, it was remarked by Brands et 



al. fldi\ that the entropy Sq[f] can lead to inconsistencies when applied to the context 
of the violent relaxation (in order to describe the QSS) where the system's dynamics is 
governed by the Vlasov equation. Indeed, the Tsallis distribution fq (r, v) may not respect 
the properties of the Vlasov equation, in particular the fact that the DF is bounded by the 
maximum value of the initial condition: /(r,v) < /,„ax(r, v,? = 0). In our point of view 
[7^ [3^, the proper form of Tsallis entropy in the context of the violent relaxation is 



Sq[p] = ^ / (p'^(r,v,T]) -p(r,v,r]))JrJvJT]. (121) 

This is a functional of the probability density p(r,v, 77) which takes into account the 
specificities of the coUisionless (Vlasov) dynamics. For ^ ^ 1, it returns the Lynden- 
Bell entropy (|79l ). For q ^ I, it could take into account incomplete mixing and non- 
ergodicity. In that context, the q parameter could be interpreted as a measure of mixing 
(^ = 1 if the system mixes efficiently) and Tsallis entropy (11211) could be interpreted as 
a functional attempting to take into account non-ergodicity in the process of incomplete 
violent relaxation □. Maximizing Sq[p] at fixed mass, energy and Casimirs leads to a q- 
generalization of the equilibrium state (|8TI) . This maximization principle is a condition 
of thermodynamical stability (in Tsallis' generalized sense) in the context of violent 
relaxation. Then, we can obtain a ^-generalization of the equilibrium coarse-grained 
distribution function (|84|) . In the case of two levels / G {0, T]o}, and in the dilute limit 
of the theory / ^ T]o, Sq[p] can be written in terms of the coarse-grained distribution 



^ A striking property of the Tsallis entropy is to yield distribution functions that have a compact support 
(for q> 1) so that the density vanishes above a maximum energy. This leads to a confinement in phase 
space that is qualitatively (and sometimes quantitatively) similar to what is observed in situations of 
incomplete relaxation (see discussion in Sec. 6). Indeed, since mixing is never complete, the high energy 
tail of the distribution is less populated than what is predicted by the Boltzmann statistical mechanics 
(note that this notion of confinement is similar to the notion of maximum entropy "bubble" introduced in 
II72II ). However, as discussed in ll67ll . it is not clear why this confinement should necessarily (universally) 
be described by a Tsallis ^'-distribution. Other distributions with a compact support could also result from 
an incomplete relaxation. 



/ = pr/o in the form 

^^[^ = / [(^/^o)' - (7/^o)]^rJv, (122) 

similar to Eq. (11201 ). but with a different interpretation than the one given in I TtLIbH ItqIi. 



However, as stated in 17611 . it is not clear why complicated effects of non-ergodicity 
(incomplete mixing) could be encapsulated in a simple functional such as (I121|) . Other 
functionals of the form S = — J C{p)drd\dr} (where C is a convex function) could be 
considered as well 0. Tsallis generalized entropy Sq and Tsallis q distributions can 
probably describe a certain type of non-ergodic behaviour but not all of them: they 
are not "universal attractors" of the (coarse-grained) Vlasov equation. For example, 



elliptical galaxies are not stellar polytropes [15411 so their DF cannot be fitted by the 



Tsallis distribution 16711 ■ An improved model is a composite model that is isothermal 



in the core (justified by Lynden-Bell's theory of violent relaxation) and polytropic in 
the halo (due to incomplete relaxation) with an index n = A corresponding to ^ = 7/5 



[|80LI12|1 . This observation suggests that the achieved distributions could be described by 
entropies of the form (|121l) where q = q(r, v) depends on the position (in phase space) 
so as to take into account the degree of mixing UM- Unfortunately, this idea is not very 
useful in practice if we do not give an a priori prescription to relate the value of the 
index ^(r, v) to the phase-space region. Finally, we note that the Tsallis distribution with 
a time dependent index q{t) can provide a good fit of the distribution of the system 
during its slow coUisional evolution towards the Boltzmann distribution {q = 1). This 
idea has been developed by Taruya & Sakagami llsill in stellar dynamics and proposed 
by Chavanis ^ for the HMF model. 



8. KINETIC THEORY OF VIOLENT RELAXATION 



An alternative idea to understand the problem of incomplete violent relaxation is to 
develop a kinetic theory of the Vlasov equation on the coarse-grained scale in or- 
der to understand dynamically what limits the convergence towards the Lynden-Bell 
distribution 113 811 . Kinetic theories of colli sionless relaxation have been developed in 

with the aim to determine the dynamical equation satisfied by 
the coarse-grained DF f{v,yj). An interesting approach is based on a quasilinear the- 



ory of the Vlasov equation [|52l. 1821 13911. In the two levels approximation, the quasilinear 



theory leads to a kinetic equation for the coarse-grained DF of the form 



df df -df 
ot or ov 



rt r pH 

/ dt drid\i — (1 ^0)G(/',f-T) 
Jo J m 



X — (1^0)<i/i(77o 
m 



(123) 



^ We stress, however, that the Tsalhs entropy is the simplest generalization of the Boltzmann entropy 
because, on an axiomatic point of view, the Tsallis entropy possesses almost all the properties of the 
Boltzmann entropy. This is not the case for other types of "generalized entropies". 



where £,-, are the coarse-graining mesh sizes in position and velocity spaces, 4> is the 
smooth field produced by the coarse-grained DF and / = /(r, v,f), /i = /(rj, vi,?). This 
equation is expected to describe the late quiescent stages of the relaxation process when 
the fluctuations have weaken so that the quasilinear approximation can be implemented. 
It does not describe the early, very chaotic, process of violent relaxation driven by the 
strong fluctuations of the potential. The quasilinear theory of the Vlasov equation is 
therefore a theory of "quiescent" coUisionless relaxation. 

Equation (11231) is very similar, in structure, to Eq. (l53l) for the coUisional evolution, 
with nevertheless three important differences: (i) the fluctuating force ^(1 — > 0) is 
replaced by the direct force F(l — 0) because the fluctuations have a different nature in 
the two problems, (ii) The distribution function / in the coUisional term of Eq. (l53l) 
is replaced by the product /(t]o ~ /) in Eq. (11231) . This nonlinear term arises from 
the effective "exclusion principle", discovered by Lynden-Bell, accounting for the non- 
overlapping of phase levels in the coUisionless regime. This is consistent with the Fermi- 
Dirac-like entropy (l88l) and Fermi-Dirac-like distribution (f89l) at statistical equilibrium 
(iii) Considering the dilute limit / ^ 770 to fix the ideas, we see that the equations (11231) 
and (1531 ) have the same mathematical form differing only in the prefactors: the mass m 
of a particle in Eq. (l53l) is replaced by the mass rjocfe^ of a completely filled macrocell 
in Eq. (11231 ). This implies that the timescales of coUisional and coUisionless "relaxation" 
are in the ratio 

^ (124) 

Since rjocfe^ ^ m, this ratio is in general quite small implying that the coUisionless 
relaxation is much more rapid than the coUisional relaxation. Typically, tncoli is of the 
order of a few dynamical times to (its precise value depends on the size of the mesh) 
while tcoii is of order ~ Nt^ or larger. The kinetic equation (11231) conserves the mass and, 
presumably, the energy. By contrast, we cannot prove an H-theorem for the Lynden-Bell 
entropy (l88l) . Indeed, the time variation of the Lynden-Bell entropy is of the form 



SLB = l:£fer I drdydridyi=- =-^- ^ f dTQ{t)G{t,t ^ T)Q{t - t), (125) 

2 J /(t?o-/)/i(t?o-/i)-/o 



(2(0 = —(1^0,0 

m 



fi(vo-fi)-^-f{no 



(126) 



and its sign is not necessarily positive. This depends on the importance of memory 
effects. Now, even if Eq. (11231) conserves the energy and the mass and increases the 
Lynden-Bell entropy (l88l) monotonically, this does not necessarily imply that the system 
will converge towards the Lynden-Bell distribution (f89l) . Indeed, it has been observed in 
several experiments and numerical simulations that the QSS does not exactly coincide 
with the strict statistical equilibrium state predicted by Lynden-Bell because of the 
complicated problem of incomplete relaxation (see Sec. 6). This is usually explained 
by a lack of ergodicity or "incomplete mixing". Here, we try to be a little more precise 
by using the kinetic theory. There can be several reasons of incomplete relaxation: 



(i) Absence of resonances: Very few is known concerning kinetic equations of the 
form of Eq. (11231) and it is not clear if the Lynden-Bell distribution (|89l) is a stationary 
solution of that equation (and if it is the only one). As explained in Sec. 3.4 for the kinetic 
equation (l53l) describing the coUisional relaxation, the relaxation may stop because 
the current J vanishes due to the absence of resonances. This argument may also 
apply to Eq. (11231) which has a similar structure and can be a cause for incomplete 
relaxation. The system tries to approach the statistical equilibrium state (as indicated 
by the increase of the entropy) but may be trapped in a QSS that is different from the 
statistical prediction (f89l) . This QSS is a steady solution of Eq. (11231) which cancels 
individually the advective term (l.h.s.) and the effective collision term (r.h.s.). This 
determines a subclass of steady states of the Vlasov equation (cancellation of the l.h.s.) 
such that the complicated "turbulent" current J in the r.h.s. vanishes. This offers a 
large class of possible steady state solutions that can explain the deviation between the 
QSS and the Lynden-Bell statistical equilibrium state (f89l) observed, in certain cases, in 
simulations and experiments of violent relaxation. 

(ii) Incomplete relaxation in phase space: The turbulent current J in Eq. (11231) is 

driven by the fluctuations /2 = = /2 — of the distribution function generating 
the fluctuations of the potential (see Jsj] for more details). In the "mixing region" 
of phase space where the fluctuations are strong, the DF tends to reach the Lynden- 
Bell distribution (l89l) . As we depart from the "mixing region", the fluctuations decay 
ifi ~^ 0) and the mixing is less and less efficient || J|| 0. In these regions, the system 
takes a long time to reach the Lynden-Bell distribution (|89l) and, in practice, cannot 
attain it in the time available (see (iii)). In the two levels case, we have /2 = /(t]o ~ /)• 
Therefore, the phase space regions where / — > or / ^ r/o do not mix well (the diffusion 
current J is weak) and the observed DF can be sensibly different from the Lynden-Bell 
distribution in these regions of phase space. This concerns essentially the core (/ rjo) 
and the tail (/ 0) of the distribution. Note that the confinement of the tail (/ 0) is 
consistent with the notion of maximum entropy "bubble" [72]. However, mixing can also 
be incomplete inside the "bubble", in particular in the core (/ rjo) of the distribution. 

(iii) Incomplete relaxation in time: during violent relaxation, the system tends to 
approach the statistical equilibrium state (f89l) . However, as it approaches equilibrium, 
the fluctuations of the potential, which are the engine of the evolution, become less and 
less effective to drive the relaxation. This is because the scale of the fluctuations becomes 
smaller and smaller as time goes on. This effect can be taken into account in the kinetic 
theory by considering that the correlation lengths £^(f) and £,,(?) decrease with time so 
that, in the kinetic equation (11231) . the prefactor £^(?)£v(?) — for t +0°. As a result, 
the "turbulent" current J in Eq. (11231) can vanish before the system has reached the 
statistical equilibrium state (f89l) . In that case, the system can be trapped in a QSS that is 
a steady solution of the Vlasov equation different from the statistical prediction (|89| ). 

Similar arguments have been given in [|83L 1381] on the basis of a more phenomeno- 
logical kinetic theory of violent relaxation based on the MEPP. The idea is to keep the 
Lynden-Bell entropy (l79l) unchanged (as being the fundamental entropy for the process 
of violent relaxation) but describe the dynamical evolution of p (r, v, r] , ?) by a relaxation 



equation of the form 
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dt 
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with a diffusion coefficient D(r,v,?) going to zero for large time (as the variations of 
the potential 4> decay) and in regions of phase-space where the fluctuations 54> are 
not strong enough to provide efficient mixing (so that /2 — * 0). The vanishing of the 
diffusion coefficient can "freeze" the system in a subdomain of phase space and account 
for incomplete relaxation and non-ergodicity. In general, the resulting state, although 
incompletely mixed, is not a Tsallis ^-distribution. This approach in interesting because 
it is not based on a generalized entropy, so there is no free parameter like q. However, 
it demands to solve a dynamical equation (11271) to predict the QSS. The idea is that, in 
case of incomplete relaxation (non-ergodicity), the prediction of the QSS is impossible 
without considering the dynamics [[sF 



9. GENERALIZED //-FUNCTIONS IN /-SPACE 



In order to quantify the importance of mixing, Tremaine et al. mm have introduced 
the notion of generalized //-functions. A generalized //-function is a functional of the 
coarse-grained DF of the form 

^[7]=- jc(J)drdy, (128) 

where C is any convex function {C" > 0). We assume that the initial condition at ? = has 
been prepared without small-scale structure so that the fine-grained and coarse-grained 
DF are equal: /(r, v,0) = /(r,v,0). For t > 0, the system will mix in a complicated 
manner and develop intermingled filaments so that these two fields will not be equal 
anymore. We have 

H{t) -H{0) = J {C[7(r,v,0)] -C[7(r,v,r)]}JrJv 

= J {C[/(r,v,0)] -C[7(r,v,r)]}JrJv. (129) 

The fine-grained DF is solution of the Vlasov equation 

^+v|^_V$|^ = 0. (130) 
dt dr dy 



Thus 



d 
dt 



J C{f)drdy = J C'if) ^drdy = -J C'{f) {y^ - V4.|^^ drdy 



This shows that the functional H[f] calculated with the fine-grained DF is independent 
on time (it is a particular Casimir) so Eq. (11291) becomes 



H{t) -H{0) = J {C[/(r,v,r)] -C[f{r,yj)]}drdy. (132) 

Let us now divide a macrocell of surface A into V microcells. We call fi the value of the 
DF in a microcell. The contribution of a macrocell to H{t) — H{0) is 

a{^Icu)-c(^ £/;■)}, (133) 

which is positive since C is convex. This implies that H{t) —H{0) > 0. We conclude that 
the generalized //-functions //[/] calculated with the coarse-grained DF increase in the 
sense that H{t) > H{0) for any t >0 This is similar to the Boltzmann //-theorem in 
kinetic theory. However, contrary to the Boltzmann equation, the Vlasov equation does 
not single out a unique functional (the above inequality is true for all //-functions) and 
the time evolution of the //-functions is not necessarily monotonic (nothing is implied 
concerning the relative values of //(?) and H{t') for tj' > 0). 

Exemple: the Tsallis functional expressed in terms of the coarse-grained DF 



is a particular generalized //-function. This interpretation of Tsallis functional as a 
generalized //-function in the sense of was given by Plastino & Plastino ifsill and 
Chavanis [65, 3^, 12]. The Boltzmann functional H\ [/] = — / flnfdrdy, corresponding 



to ^ = 1 is also a particular //-function in the sense of [|4! 



10. FORMAL NONLINEAR DYNAMICAL STABILITY FOR THE 

VLASOV EQUATION 

In this section, we discuss the formal nonlinear dynamical stability of stationary solu- 
tions of the Vlasov equation. We consider a special class of Casimir functionals of the 
form 

S[f] = - j C{f)drdy, (135) 

where C is any convex function (C" > 0). These functionals, as well as the energy and 
the mass, are conserved by the Vlasov equation. It results that the maximization problem 

max {S[f] I E[f]=E,M[f]=M} (136) 

determines a steady state of the Vlasov equation that is formally nonlinearly dynami- 
cally stable. Writing the first variations as 5S — fiSE — a5M = 0, the steady state satis- 
fies / = F(/3£ + a) where F{x) = {C')^^ {—x). Therefore, this distribution function is a 
monotonically decreasing function of the energy (for /3 > 0): / = /(e) with /'(e) < 0. 



Formally, the maximization problem (11361) is similar to a condition of "microcanonical 
stability" in thermodynamics where S plays the role of an entropy. Due to this thermody- 



namical analogy [16511 . we can use the methods of thermodynamics to study the nonlinear 
dynamical stability problem (11361) . We also note that if /(r, v) solves the minimization 
problem 

mm{F[/] =E[f]- TS[f] \M[f] = M} , (137) 

then it solves the maximization problem (11361) and is therefore nonlinearly dynamically 
stable. However, the reciprocal is wrong in case of ensemble inequivalence. A solution 
of (11361) is not necessarily a solution of (11371) . Therefore, the minimization problem 
(11371) provides just a sufficient condition of nonlinear dynamical stability. The stability 
criterion (11361) is more refined than the stability criterion (11371) : if a distribution / = 
/(e) with /'(e) < satisfies (11361) or (11371) . then it is nonlinearly dynamically stable; 
however, if is does not satisfy (11371 ) it can be nonlinearly dynamically stable provided 
that it satisfies (11361) . This means that we can "miss" stable solutions if we just use 
the optimization problem (11371) to construct stable stationary solutions of the Vlasov 
equation of the form / = /(e) with /'(e) < 0. The criterion (11371) determines a subclass 
of solutions of the maximization problem (11361) . The criterion (11361) is richer and allows 
to construct a larger class of nonlinearly dynamically stable stationary solutions. This is 
similar to a situation of "ensemble inequivalence" in thermodynamics [Tli 3- Indeed, 



(11371) is similar to a criterion of "canonical stability" in a thermodynamical analogy 



Il65i1 . where F is similar to a "free energy". Canonical stability implies microcanonical 
stability but the converse is wrong in case of ensemble inequivalence. 

For a long time, most studies related to the formal nonlinear dynamical stability 
of the Vlasov equation considered the minimization of an Energy-Casimir functional 
E — S at fixed mass (problem with one constraint) [85]. This Energy-Casimir method 
was first introduced by Arnold in two-dimenional hydrodynamics for the Euler 
equation. This corresponds to the "canonical" criterion (11371) . It only provides a sufficient 
condition of formal nonlinear dynamical stability. Recently, Ellis et al. [|4lll introduced 
a refined stability criterion for the 2D Euler equation in the form the "microcanonical" 
criterion (11361) (problem with two constraints). In [|42ll . we introduced a similar criterion 
to study the formal nonlinear dynamical stability of stellar systems with respect to the 
Vlasov-Poisson system. Explicit examples of "ensemble inequivalence" in the context 
of the nonlinear dynamical stability for the 2D Euler and Vlasov equations, where the 



criteria (11361) and (11371) do not coincide, were explicitly constructed by Ellis et al. 114 ih 



for geophysical flows and by Chavanis [|42|1 for stellar polytropes. In the astrophysical 



context, it was shown [|42ll that "ensemble inequivalence" leads to a nonlinear version of 
the Antonov first law (see Sec. 11). 

Example: the extremization of the Tsallis functional 



Sq[f] = -^j{f-f)dTdy, 



at fixed mass and energy leads to distribution functions of the form 



(138) 



/(r,v) 



11 - ^-^ -£ 
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(139) 



where jj. and /3 are Lagrange multipliers determined by M and E fsv']. These DF are 
particular stationary solutions of the Vlasov equation called stellar polytropes in astro- 
physics [54]. Furthermore, if the DF maximizes the Tsallis functional at fixed mass and 
energy, then it is formally nonlinearly dynamically stable with respect to the Vlasov 
equation according to (11361) . In this context, the Tsallis functional (11381) is a particu- 
lar Casimir functional of the form (11351) . not a generalized entropy. Its maximization at 
fixed mass and energy forms a criterion of nonlinear dynamical stability for the Vlasov 
equation, not a criterion of generalized thermodynamical stability in the microcanonical 
ensemble. This dynamical interpretation of the Tsallis functional related to the Vlasov 
equation was given by Chavanis & Sire 11881] . It differs from the thermodynamical in- 
terpretation given by Taruya & Sakagami [|79|] in terms of generalized thermodynamics. 
On the other hand, the minimization of the Tsallis functional Fq = E — TSq at fixed 
mass forms a sufficient criterion of nonlinear dynamical stability for the Vlasov equa- 
tion, not a criterion of generalized thermodynamical stability in the canonical ensemble. 
In the present context, the resemblence with a generalized thermodynamical formalism 
is effective and is the mark of a thermodynamical analogy n65\\ . Note that the preceding 
discussion also applies to the Boltzmann distribution, corresponding to ^ = 1, which is 
also a particular solution of the Vlasov equation. 

Numerical algorithms: A relaxation equation has been proposed to solve the maxi- 
mization problem (11361) . It is obtained from a Maximum Entropy Production Principle 
(MEPP), exploiting the thermodynamical analogy, and has the form of a generalized 
mean field Kramers equation [|650. It can serve as a numerical algorithm to determine 
a nonlinearly dynamically stable stationary solution of the Vlasov equation, specified 
by the convex function C(/), for given E and M. The relaxation equation solving the 
"microcanonical" problem (11361) is 
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This equation conserves the mass M, the energy E (through the time dependent Lagrange 
multiplier /3(?)) and monotonically increases the 5-functional: 5 > (//-theorem). Fur- 
thermore, 5 = iff / is a steady solution of Eq. (11401 ). Therefore, S[f] is the Lyapunov 
functional of the relaxation equation (I140I) - (I141I) . It results that /(r, v) is a linearly dy- 
namically stable stationary solution of Eq. (I140I) - (I14H) iff it is a (local) maximum of S at 
fixed E, M. Minima or saddle points of S are dynamically unstable. Furthermore, if S is 
bounded from above (within the previous constraints), /(r, v, t) will relax towards a (lo- 
cal) maximum of S at fixed E, M for t ^ +oo (if there exists several local maxima of S, 
the selection will depend on a complicated notion of basin of attraction). Therefore, Eq. 
(|140I) - (I14H) can serve as a numerical algorithm to solve the "microcanonical" problem 
(|136l) and compute a nonlinearly dynamically stable stationary solution of the Vlasov 
equation, specified by C, for given E and M. Note that the generalized Vlasov-Landau 



(GVL) equation 
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introduced in ll89ll has the same properties as the relaxation equation (I140I) - (I141I) . In par- 
ticular, /(r, v) is a linearly dynamically stable stationary solution of the GVL equation 
iff it is a (local) maximum of S at fixed E, M. Minima or saddle points of S are dynam- 
ically unstable. Therefore, the GVL equation can also provide a numerical algorithm 
to compute nonlinearly dynamically stable stationary solutions of the Vlasov equation, 
specified by C, for given E and M. We note that /(r, v) is a linearly dynamically sta- 
ble stationary solution of the generalized Vlasov-Landau equation iff it is a formally 
nonlinearly dynamically stable stationary solution of the Vlasov equation. 

If we fix the inverse temperature /3 instead of the energy in Eq. (11401) . we get a 
relaxation equation 



df df df d ,^ 
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solving the "canonical" problem (11371) . It is called the generalized Vlasov-Kramers 
(GVK) equation. This equation conserves the mass M and monotonically decreases 
the F-functional: F < 0. Furthermore, F = iff / is a steady solution of Eq. (11431) . 
Therefore, F[f] is the Lyapunov functional of the GVK equation. It results that /(r, v) 
is a linearly dynamically stable stationary solution of the GVK equation iff it is a (local) 
minimum of F at fixed M. Maxima or saddle points of F are dynamically unstable. 
Furthermore, if F is bounded from below (within the previous constraints), /(r, v, t) will 
relax towards a (local) minimum of F at fixed M. Therefore, Eq. (11431) can serve as a 
numerical algorithm to solve the "canonical problem" (11371) and determine a subclass of 
nonlinearly dynamically stable stationary solutions of the Vlasov equation, specified by 
C, since (11371) =>■ (11361) . We note that a linearly dynamically stable stationary solution 
of the generalized Vlasov-Kramers equation is ( i) a formally nonlinearly dynamically 
stable stationary solution of the Vlasov equation ( ii) a linearly dynamically stable 
stationary solution of the generalized Vlasov-Landau equation (but the reciprocal is 
wrong in case of ensemble inequivalence). 

We can also use the preceding relaxation equations to solve the optimization prob- 
lems (II 181) and (II 191) of Sec. 5. If we view / as the coarse-grained DF / and if we take 
C(/) of the form (II 171) . a linearly dynamically stable stationary solution of the relax- 
ation equation (I140l) - (|141|) is a Lynden-Bell statistical equilibrium state according to the 
sufficient condition (II 181) =^ (|9T| ). Similarly, a linearly dynamically stable stationary 
solution of the generalized Vlasov-Landau equation is a Lynden-Bell statistical equi- 
librium state (but the reciprocal is wrong in case of ensemble inequivalence). On the 
other hand, a linearly dynamically stable stationary solution of the relaxation equation 
(11431) is a Lynden-Bell statistical equilibrium state according to the sufficient condition 
(II 191 ) =^ (|9T| ). Therefore, a linearly dynamically stable stationary solution of the gen- 
eralized Vlasov-Kramers equation is a Lynden-Bell statistical equilibrium state (but the 
reciprocal is wrong in case of ensemble inequivalence). 



11. THE NONLINEAR ANTONOV FIRST LAW 



It was first realized by Antonov [90] in astrophysics, that there exists deep connections 
between the dynamical stability of steady solutions of the Vlasov equation (|68l) of the 
form / = /(e) with /'(e) < and the dynamical stability of steady solutions of the Euler 
equations 

^ + V-(pu)=0, (144) 

^ + (u-V)u= --Vp-V4), (145) 
p 

with a barotropic equation of state p = p{p) (as usual 't'{r,t) = J u{\r — r'\)p{r' ,t)dr' is 
the mean field potential). The Euler equations conserve the mass M[p] = / pdr and the 
energy 

^[p,u] =Jp J'' E^dp'dr+y p^dr+y pu^dr, (146) 

including the internal energy, the potential energy and the kinetic energy of the mean 
motion. It results that the minimization problem 

min {W[p,u\ \ M[p]=M}, (147) 

P 1^ 

determines a steady state of the Euler equation that is formally nonlinearly dynamically 
stable. Writing the first order variations as dW — adM = 0, we get u = and 

PiEldp' = (148) 
J p' 

This leads to the condition of hydrostatic equilibrium 

Vp + pV$ = 0. (149) 

We also note that p =p(^) and that p'{p)/p = -\/p'{^) so thatp(4>) is monotonically 
decreasing (since p'{p) > in cases of physical interest). 

Let us now return to the Vlasov equation. We have seen that the minimization prob- 
lem (11371) provides a sufficient condition of formal nonlinear dynamical stability. In 
order to solve this minimization problem, we can proceed in two steps (we consider 
here the global minimization problem. The local minimization problem is treated in the 
Appendix). First step: we determine the distribution /i(r,v) which minimizes F[f] at 
fixed density profile p(r) = / fd\. This gives a distribution /i[p(r),v] depending on 
p(r) and v. Substituting this distribution in the functional F[f], we obtain a functional 
F[p] = F[fi] of the density. Second step: we determine the density p*(r) which mini- 
mizes F[p] at fixed mass M[p] = M. Finally, we have /*(r, v) = /i [p*(r), v]. Let us be 
more explicit. If we fix the density profile p(r), the potential energy W[p] is automat- 
ically determined. Therefore, minimizing F[f] =£[/] — TS[f] at fixed density profile 



is equivalent to minimizing F[f] = K[f] — TS[f] at fixed density profile, where K[f] = 
J J fv^drdy is the kinetic energy. The distribution fi (r, v) that extremizes F [/] with the 
constraint / fd\ = p(r) satisfies the first order variations 5F + /A(r)5 / fd\dr = 0, 
where A (r) is a Lagrange multiplier. This leads to 



fi=F{l5 



| + A(r) 



(150) 



where F(jc) = (C') ^ (— x) and X (r) is determined by p (r) by writing p = J fid\. Since 

d^F = -T5^S =\t j C"{fx){5ffdrdy > 0, (151) 

this is a minimum of F[f] at fixed density profile. According to Eq. (11501) . the density 
p = / f\dy = p[A(r)] and the pressure p = \ I fi^^dy — p[X{r)] are functions of A(r). 
Eliminating A (r) between these two expressions we obtain a barotropic equation of state 
p = p{p). Then, after simple calculations (see Appendix B of ll42ll ). we can show that 
the functional F[p] =F[fi] is given by 



F[p] =jpf ^dp'dv+^- 1 p^dr. (152) 



Therefore, we conclude that /*(r,v) = /i[p*(r),v] solves the minimization problem 
(11371 ) iff p*(r) solves the minimization problem 

mm {F[p] I M[p]=M}. (153) 

Now we observe that the solutions of the minimization problems (11471) and (11531) coin- 
cide. Therefore, we conclude that /*(r, v) = /i[p*(r),v] solves the minimization prob- 
lem (11371) iff p*(r) solves the minimization problem (11471) . On the other hand, since 
the ensemble of solutions of (11371) is included in the ensemble of solutions of (11361) we 
conclude that : /*(r,v) = /i[p*(r),v] solves the maximization problem (11361) if p*(r) 
solves the minimization problem ( 11471) . but the reciprocal is wrong in case of ensemble 
inequivalence. In conclusion, we find that a DF / = /(g) with f'{£) < is a nonlinearly 
dynamically stable steady state of the Vlasov equation (kinetic system) if the corre- 
sponding density profile p(r) is a nonlinearly dynamically stable steady state of the 
Euler equation (fluid system), but the reciprocal is wrong. In astrophysics, this forms the 
nonlinear Antonov first law [42]: "a stellar system with / = /(e) and /'(e) < is a non- 
linearly dynamically stable steady state of the Vlasov-Poisson system if the correspond- 
ing barotropic star is a nonlinearly d yna mically stable steady state of the Euler-Poisson 
system (but the reciprocal is wrong)."[!jThis can be seen as a manifestation of "ensemble 



inequivalence" in a thermodynamical analogy [|42|] . For example, complete polytropes 



^ ' This law was proven by Antonov ll90l] for the problem of linear dynamical stability and extended by 
Chavanis II42II to the problem of formal nonlinear dynamical stability. 



with index 3 < n < 5 satisfy the "microcanonical" criterion (11361) but not the "canon- 
ical" criterion (I137D <^ (I153D (I147I ). Thus, stellar polytropes with index 3 < n < 5 
are nonlinearly dynamically stable with respect to the Vlasov-Poisson system while the 
corresponding gaseous polytropes are unstable with respect to the Euler-Poisson system 



Example: When S[f] is the Tsallis functional (I138|) . the corresponding barotropic gas 
is a gaseous polytrope with an equation of state fi88i1 : 

p = Kp^, 7=1 + -, n = ^ + ^. (154) 

n q—l 2 

The functional (I152|) is of the form 

^^P^ = ^J iP'-p)dr+\J P^dr. (155) 

This can be viewed as a Tsallis "free energy" functional Fy[p] = W[p] — KSylp] with 
index 7 where W is the (potential) energy and K plays the role of a "polytropic temper- 
ature" (recall again that these analogies with thermodynamics are purely formal in the 
present dynamical context). It can be compared with the Tsallis "free energy" functional 
Fg [f] =E[f] — TSq [/] in phase space (see Sec. 10). The steady density distribution (11481) 
is of the form 



Ky 



r-i 

(156) 



which is morphologically similar to Eq. (11391 ). Therefore, a Tsallis distribution fq{e) in 
phase space yields a Tsallis distribution Py{^) in position space [[88|] . 

Numerical algorithm: A relaxation equation has been proposed to solve the mini- 



mization problem (11531) . It is called the generalized Smoluchowski (GS) equation [|65|] . 
It can serve as a numerical algorithm to determine nonlinearly dynamically stable sta- 
tionary solutions of the Euler equations, specified by p{p), for given M. It is written 
as 

|(Vp + pV4)) 



dt 



(157) 



This equation conserves the mass M and monotonically increases the F-functional: 
F > (//-theorem). Furthermore, /" = iff p is a steady solution of (11571) . Therefore, 
F [p] is the Lyapunov functional of the GS equation (|157l) . It results that p (r) is a linearly 
dynamically stable stationary solution of the generalized Smoluchowski equation iff it 
is a (local) minimum of F at fixed M. Maxima or saddle points of F are dynamically 
unstable. Furthermore, if F is bounded from below (within the previous constraint), 
p(r, f) will relax towards a (local) minimum of F at fixed M iox t ^ +0°. Therefore, 
Eq. (|157l) can serve as a numerical algorithm to solve the "canonical" problem (11531) 
and compute a nonlinearly dynamically stable stationary solution of the Euler equation, 
specified by p{p), for given M since (11531) (11471) . We note that p(r) is a linearly 
dynamically stable stationary solution of the generalized Smoluchowski equation iff it 



is a formally nonlinearly dynamically stable stationary solution of the barotropic Euler 
equation. There are other corrolaries to that result. Since (11371) ^ (11531 ) we conclude that 
/(r,v) is linearly dynamically stable with respect to the generalized Vlasov-Kramers 
equation i[143\l iff the corresponding density p (r) is linearly dynamically stable with 
respect to the generalized Smoluchowski equation A157\) . On the other hand, according 
to the implications (fT53]) ^ (fT47l) ^ (fT37l) ([1361) and (fT53l) ^ (fT37l) ^ (fTOSl) ^ 
(11071) ^ ( |9T| ). we conclude that a linearly dynamically stable stationary solution of 
the generalized Smoluchowski equation determines ( i) a nonlinearly dynamically stable 
stationary solution of the Vlasov equation ( ii) a linearly dynamically stable stationary 
solution of the generalized Vlasov-Landau equation Iil42\l (Hi) a Lynden-Bell statistical 
equilibrium state (but the reciprocal is wrong in case of ensemble inequivalence). 

12. SELECTIVE DECAY PRINCIPLE 

Let us conclude this paper by some phenomenological considerations. The results of 
Sec. 9 suggest a notion of generalized selective decay principle lt36l [67ll : among all 
invariants of the coUisionless dynamics, the generalized //-functions (fragile constraints) 
increase (— // decrease) on the coarse-grained scale while the mass and the energy 
(robust constraints) are approximately conserve According to this principle, we may 
expect (phenomenologically) that the QSS reached by the system as a result of violent 
relaxation will maximize a certain //-function called H* [/] (non-universal) at fixed mass 
M and energy E. Therefore, it is expected to satisfy a problem of the form 

max {//*[/] I E[f]=E,M[f]=M}. (Phenomenological) (158) 

/ 

As we have seen, this maximization problem determines a distribution function / = /(e) 

with / (e) < which is a stationary solution of the Vlasov equation (recall that our 
argument applies to the coarse-grained distribution). In general, the //-function H*[f\ 
that is effectively maximized by the system as a result of violent relaxation (if it really 
maximizes an //-function!) is difficult to predict. It depends on the initial conditions and 
on the efficiency of mixing. If mixing is complete (ergodicity), the statistical theory of 
Lynden-Bell shows that the coarse-grained DF (l84l) extremizes a functional of the form 
(|95])- (I102I) at fixed mass and energy. Although we cannot conclude that this extremum is 
always a maximum because of the problem of ensemble inequivalence reported in Sec. 5, 
it is likely however that in many cases the coarse-grained DF predicted by the statistical 
theory of Lynden-Bell maximizes the generalized entropy (II 14|) - (ll 171) at fixed mass and 
energy. Therefore, we conclude that when mixing is complete H* = S, where S is given 



A selective decay principle was first introduced in two-dimensional turbulence, for the 2D Navier- 
Stokes equation, based on different arguments | [9ll] . It states that, in the presence of a small viscosity 
V ^ 0, the energy of the flow is approximately conserved while the enstrophy decays. This leads to a 
phenomenological minimum enstrophy principle where the "equilibrium" flow minimizes the enstrophy 
at fixed circulation and energy. Our selective decay principle, which also applies to the (inviscid) 2D 
Euler equation ll36ll . is more general (the enstrophy is a particular //-function) and uses the notion of 
coarse-graining rather than viscosity or other source of dissipation. 



by Eqs. (II 14l) - (ll 171) . In other words, apart from the problem of ensemble inequivalence 
which demands further investigation, the statistical theory of Lynden-Bell justifies the 
generalized selective decay principle (11581) and gives the expression of H* [/] , depending 
on the initial condition. We again emphasize the importance of the notion of coarse- 
graining: the functionals H[f] calculated with the fine-grained DF are conserved while 
the //-functions //[/] calculated with the coarse-grained DF increase. Furthermore, the 
particular //-function //*[/] = ^f/j (whose form depends on the initial conditions) is 
maximum at statistical equilibrium while E[f\ and M[f\ are approximately conserved. 
Therefore, although the Lynden-Bell theory takes into account the conservation of the 
Casimirs //[/], it leads to a coarse-grained distribution which maximizes a functional 
//*[/] at fixed E and M. On the other hand, if mixing is incomplete (see Sec. 6), the 
coarse-grained DF may still maximize a certain generalized //-function at fixed mass and 
energy but //*[/] and /(e) can take forms that are not compatible with the expressions 
(|1 14l) - (ll 171) and (l83l) derived in the statistical approach. In case of incomplete relaxation, 
the prediction of this functional H* [/] seems out-of-reach because its form depends on 
the efficiency of mixing which is not known a priori. In particular, there is no reason 
why //* [/] should always (universaly) be the Tsallis functional. However, it has been 
observed in several occasions 11771. 1 171.1411 that the Tsallis distributions are "attractors" of 
the solutions of the (coarse-grained) Vlasov equation for some initial conditions. 

We now observe that the criterion of formal nonlinear dynamical stability (11361) is 
remarkably consistent with the phenomenological selective decay principle of violent 
relaxation (11581) provided that we interprete / as the coarse-grained DF. This coinci- 
dence looks surprising at first sight. A priori, the optimization problems (11361) and (11581) 
are completely different: the problem (11361) explicitly uses the fact that the functional 
//[/] is conserved by the Vlasov equation while the problem (11581) uses the fact that the 
functional //[/] calculated with the coarse-grained DF increases during violent relax- 
ation. In fact, the problem (11361) indicates that if a. DF is a maximum of H at fixed mass 
and energy, then it is nonlinearly dynamically stable with respect to the Vlasov equation 
while the phenomenology of violent relaxation leading to the problem (11581) explains 
how, starting from an unstable initial condition, a DF can possibly reach a maximum of 
H at fixed mass and energy (through mixing) although H is rigorously conserved by the 
Vlasov equation. Indeed, during mixing the coarse-grained DF is not conserved by the 
Vlasov equation {Df /Dt ^ where D/Dt h the material derivative in phase space) and 
the //-functions //[/] increase. Once it has mixed, the coarse-grained DF satisfies the 
Vlasov equation Df /Dt = so that the Casimirs calculated with the coarse-grained DF 
are now conserved //[/] = 0, as well as the mass and the energy. Indeed, after mixing 
the coarse-grained field must be viewed as a new fine-grained field in a possibly further 
evolution. Now, the phenomenology of violent relaxation (11581) suggests that /(r,v,?) 
will be brought to a maximum /^(r, v) of a certain //-function //*. This is proven by 
the theory of Lynden-Bell in case of efficient mixing (up to the problem of ensemble in- 
equivalence) and this may remain true even in case of incomplete relaxation. Since //[/], 
E[f] and M[f] are conserved by the Vlasov equation after mixing, and since /^(r, v) is 
a maximum of //* [/] at fixed mass and energy, then it is a formally nonlinearly dynam- 
ically stable steady state of the Vlasov equation according to the criterion (11361) . 

Let us make a final comment. The maximization (under appropriate constraints) of 
the Lynden-Bell entropy (|79|) . of the Tsallis entropy (11211) . of an H-function (11281) or 



of a convex Casimir (11351) leads to a distribution function of the form / = /(e) with 
/ (e) < depending only on the energy. In astrophysics, these DF can only describe 
spherical stellar systems (and even a sub-class of them) [1540. In reality, stellar systems 
are not spherical and their distribution functions are not function of the energy alone 
0. Indeed, according to the Jeans theorem ^5^, there exists more general stationary 
solutions of the Vlasov equation which depend on other integrals of motion. This 
indicates that the structure of the final state of a coUisionless stellar system depends on 
its dynamical evolution in a complicated manner. An important problem in astrophysics 
is therefore to find the form of distribution function appropriate to real galaxies. Simple 
concepts based on entropies, //-functions or formal nonlinear dynamical stability are 
not sufficient to understand the structure of real galaxies. This is particularly deceptive. 
However, conceptually, the theory of violent relaxation is important to explain how a 
coUisionless stellar system reaches a steady state (QSS). This is due to phase mixing 
in phase space. The coarse-grained DF /(r,v,?) reaches a steady state /g55(r,v) in 
a few dynamical times while the fine-grained distribution function /(r, v,f) develops 
filaments at smaller and smaller scales and is never steady (presumably). Since this 
mixing process is very complex, the resulting structure fgssi^i'^) should be extremely 
robust and should be therefore a nonlinearly dynamically stable stationary solution of 
the Vlasov equation. Thus, the theory of incomplete violent relaxation explains how 
coUisionless stellar systems can be trapped in nonlinearly dynamically stable (robust) 
stationary solutions of the Vlasov equation on the coarse-grained scale. 



13. SUMMARY: THE DIFFERENT FUNCTIONALS 

We would like to again emphasize the distinction between entropies and //-functions 



[|36|1 . An entropy is a quantity which is proportional to the logarithm of the disor- 
der, where the disorder is equal to the number of microstates consistent with a given 
macrostate. This is how the Lynden-Bell entropy (|79|) has been defined. Tsallis en- 
tropy (11211) could be considered as a generalization of this definition in the case where 
the phase-space has a complex structure so that the evolution is non-ergodic and the 
potentially accessible microstates are not equiprobable. In each case, the entropy is 
a functional of the probability p(r,v, r]) and the maximization of these entropies at 
fixed mass, energy and Casimirs is a condition of thermodynamical stability. The "gen- 
eralized entropies" (II 14l) - (ll 171) defined in Sec. 5 can be regarded as entropies which 
are proportional to the logarithm of the number of microstates consistent both with a 
given macrostate and with the constraints imposed by the Vlasov equation (Casimirs). 
Their functional form depends on the initial condition. They are defined on a projection 
space (/-space) where a macrostate is defined by the specification of /(r,v) instead of 
p(r, V, T]). On the other hand, the //-functions do not have a statistical origin. They are 
just arbitrary functionals of the coarse-grained distribution /(r,v,?) of the form (11281) . 



This shows that the phenomenological principle ( I158l l is not always valid. Indeed, real (non-spherical) 
galaxies do not maximize an H function at fixed mass and energy although they probably result from a 
process of violent relaxation. 



They increase during violent relaxation and they are useful to characterize the degree 
of mixing of a coUisionless system [40]. According to the phenomenological selective 
decay principle (11581) . the QSS reached after violent relaxation is expected to maximize 
a certain //-function (non-universal) at fixed mass and energy. According to criterion 
(|136l) . this principle ensures that the QSS is nonlinearly dynamically stable with respect 
to the Vlasov equation. This is to be expected since the QSS results from a turbulent 
mixing which makes it very robust. In case of complete violent relaxation, the selective 
decay principle (11581) is justified by the statistical theory of Lynden-Bell and H* = S 
where S is given by Eqs. (II 14l) - (ll 171) . In case of incomplete relaxation, the //-function 
H* and the distribution function /q^^ of the QSS are different from the Lynden-Bell 
prediction. In some cases of incomplete relaxation [|77l . the QSS is close to 

a Tsallis distribution and H* is close to the Tsallis functional, but this is not universaly 
true. 

Examples: Tsallis functional Sq[p] expressed in terms of p(r,v, r]) is a generalized 
entropy. Tsallis functional Sq[f] expressed in terms of /(r,v) is (i) a generalized H- 
function (ii) a particular case of generalized entropy Sq[p] for two levels / G {0, 170} 
in the dilute limit / <ti rjo. Tsallis functional Sq[f] expressed in terms of /(r,v) is a 
particular convex Casimir whose maximization at fixed mass and energy provides a 
condition of formal nonlinear dynamical stability for the Vlasov equation (this leads 
to stellar polytropes in astrophysics). 



14. CONCLUSION 

In this paper, we have studied the dynamics and thermodynamics of systems with weak 
long-range interactions. In particular, we have stressed the physical interpretation of the 
different functional appearing in the analysis. In our discussion, we have exclusively 
considered the case of isolated Hamiltonian systems described by the microcanonical 
ensemble. In this concluding section, we would like to briefly discuss the physics of 
related systems. 

It is interesting to compare the microcanonical evolution of Hamiltonian systems with 
long-range interactions to the canonical evolution of Brownian systems with long-range 
interactions f^. In the case of Brownian particles in interaction, the microscopic equa- 
tions of motion consist in N coupled Langevin equations including a friction force and 
a stochastic force in addition to the long-range coupling. These additional terms can 
take into account short-range interactions with an external medium playing the role of 
a thermal bath. These Brownian systems are dissipative and they are described by the 
canonical ensemble. In the thermodynamic limit N +co defined in Sec. 2.2., the mean 
field approximation is exact and the evolution of the distribution function /(r, v, t) of the 
Brownian system is governed by the mean field Kramers equation (decreasing mono- 
tonically the Boltzmann free energy) This is the canonical counterpart of the Vlasov 
equation for Hamiltonian systems. This shows that, for +00, the Brownian system 
tends to relax towards the mean field Boltzmann distribution on a typical timescale 
where ^ is the friction coefficient. In the strong friction limit ^ +0°, the velocity distri- 
bution is close to Maxwellian and the evolution of the spatial density p (r, t) is governed 
by the mean field Smoluchowski equation [3]. So far, two types of Brownian systems 



with weak long-range interactions have been studied: (i) self-gravitating Brownian par- 
ticles (or bacterial populations) where the particles interact via a Newtonian potential 
[I92I1 (ii) the Brownian Mean Field (BMF) model where the particles interact via a cosine 
potential iSBllol. 

The previously described Hamiltonian and Brownian systems [0] are "simple" sys- 
tems (this does not mean that their study is trivial!) because everything is contained in 
the A'^-body Hamiltonian or Langevin equations. Recently, some authors have considered 
the case of "complex" systems where the microscopic dynamics is not perfectly known 
or can be biased with respect to the ordinary one because of some microscopic con- 
straints whose influence is difficult to formalize. It has been proposed that such systems 
could be described by an effective generalized thermodynamics (E.G.T.). The Tsallis 
n3l\\ entropy has become very popular to describe complex systems, but other forms of 



generalized entropies can also be considered [|65l. \94\ . In that context, generalized ki 



netic equations, associated with generalized forms of entropic functionals, have been 
introduced either in the microcanonical ensemble or in the canonical ensemble. In the 



first case, they have the form of generalized Boltzmann or Landau equations [|95L |9f 
and they conserve the energy and the mass. In the second case, they have the form of 
generalized Fokker-Planck equations, like the generalized Kramers and Smoluchowski 
equations [|95L Bli \94\ . If the particles interact via weak long-range forces, we obtain 
generalized mean field kinetic equations 11651 19411. The general study of these equations, 
which combine both long-range interactions and generalized thermodynamics, is very 
rich and can lead to a wide diversity of phase transitions and blow-up phenomena. For 
example, generalized mean field Fokker-Planck equations appear in the physics of bac- 
terial populations driven by chemotaxis (Keller-Segel models) [98i|93]- More generally, 
long-range interactions and generalized thermodynamics can have applications in sev- 
eral domains of physics, astrophysics and biology [[99|l . 



APPENDIX: SECOND VARIATIONS AND LOCAL STABILITY 

EQUIVALENCE 

The results of Sec. 5 show that the variational problems (11071) and (II 181) are equiva- 
lent for global maximization. In this Appendix, adapting the procedure developed by 



Bouchet II66II for the 2D Euler equation, we show that they are also equivalent for lo- 
cal maximization. A critical point of (I107|) is a local maximum of S), [p] at fixed mass, 
energy and normalization iff 

5^F[dp] = T J ^^^drdydr] + 5fd^drd\ > 0, (159) 

for all perturbations 5p that conserve mass, energy and normalization at first order. On 
the other hand, a critical point of (II 181 ) is a local maximum of S[f] at fixed mass and 
energy iff 

5^F[f] = \t j C"{f){5ffdrdy + y 5f5^drdy > 0, (160) 



for all perturbations 5f that conserve mass and energy at first order. To make the 
connection between the second order variations (11591 ) and (1160k the idea is to project 
the perturbation 8p on a suitable space (to be defined) and write 8p = dp\\ + 5p^ where 
5p^ is the orthogonal perturbation [|66l]. The perturbation 5p must satisfy / 5pdr\ = 
and / 5pr\dr\ = 5f. We impose the same constraints on 5p||, i.e. / 8p\\dr] = and 

/ Sp^^rjdr} = 5f (we shall see that this leads naturally to the orthogonality of 5p|| and 
8p±). Then, writing 5p|| = 5fgp where ^ is a function to be determined, we must have 
/ gpdrj = and / gprjdrj — 1. We look for a solution of the form g = a + brj and find 
that a + bf = and af + bp — 1 leading io a = — ///2 and b = I//2 where /2 is the 
centered local variance defined in Eq. (f87l) . Therefore, we write 

5p^^{r]-J)p + dp^, (161) 

72 

where dp^ ensures that all the perturbations are considered. By construction, we have 
/ 5p±dr} = and / 5p±r}dr} = 0. Therefore, 5p± is orthogonal to 5p\\ in the sense that 



Sp^dr^ oc I {r]-f)5p^dr] = 0. (162) 



Then, we readily obtain 



p J P J P 

= /(^,, + (f!=/■(^,,+c"(7)(^7)^ (163) 

■I P fl .1 P 

where we have used identity (11031) . Finally, the second variations (11591) can be rewritten 

5^F[dp] = T j i^^drd\dri + d^F[5f]. (164) 

If 5^F[5f] > for all perturbations 5f that conserve mass and energy at first order, then 
d^F[5p] > for all perturbations 6p that conserve mass, energy and normalization at 
first order. Alternatively, if there exists a perturbation 5f^ such that 5^F[df^] < 0, by 
taking 5p* in the form (11611) with df = df^ and 5p^ = 0, we get 5^F[5p^] < 0. We 
conclude that p (r, v, 77 ) is a local maximum of Sj^ [p] at fixed E, M and normalization iff 
7(r, v) is a local maximum of S[f] at fixed E, M. Thus: (11071) <^ (11181) locally. 

We now adapt the same procedure to show that the variational problems (11371) and 
(11531) are equivalent for local minimization (we have already shown in Sec. 1 1 that they 
are equivalent for global minimization). A critical point of (11371) is a local minimum of 
F[f] at fixed mass iff 

d^F[f] = ^T lc"{f){5ffdrdy + y 5pd^dr>0, (165) 



for all perturbations 5f that conserve mass at first order. On the other hand, a critical 
point of (11531 ) is a local minimum of F[p] at fixed mass iff 

d^F[p] = J ^^{5pfdr+^ J 5p5mr > 0, (166) 

for all perturbations 5p that conserve mass. We can always write the perturbation in the 
form 

where 5f^ ensures that all the perturbations are considered. By construction, we have 
/ 5fdy = J df\\dy = 5p so that / 5f^d\ = 0. Therefore, 5f^ is orthogonal to in 
the sense that 

J C'\f)5f\\5f^dy oc j 5f^dy = 0. (168) 
Then, we readily obtain 

J C"{f){5ffdy = j C"{f){5f^fdy + j C"{f){5f\\fdy 

= [c"{f){5f^)^dy + ^^. (169) 

J c"if) 

Now, a critical point of (11371) is of the form / = F(/3£ + a) = /(e) with £ = v^/2 + ^{r) . 
If we introduce the density p = [ fdy and the pressure P = f fv^dy, we find that p = 
p(4>) and p = p{^). Eliminating 4>(r) between these relations, we obtain a barotropic 
equation of state p = p{p), the same as in Sec. 11. We easily establish that 

Vp = -, f ^v^dy = -y^ I f(e)v^dy 
djdr d J ^ ' 

= -V$ [^.ydy= -V$ [fdy = -pV4>, (170) 
d J dy J 

so that the condition / = /(e) is equivalent to the condition of hydrostatic equilibrium 
Vp — — pV$. This implies p'{^) = — p(^) and 

p'(p) 1 1 T 

= (171) 



p'(4.) ff'{e)dy f 



c"{f) 



where we have used the identity C"{f) = —j5/f'{£) resulting from C'{f) = — /3£ — a. 
Combining Eqs. (11691) and (11711) we finally obtain 

d^F[df] = \t j C"{f){8f^fdrdy + 5^F[5p]. {Ill) 

From this equality, we conclude that /(r, v) is a local minimum of F[f] at fixed mass iff 
p(r) is a local minimum of F[p] at fixed mass. Thus: (11371) <^ (11531) locally. 
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